: John Forbes Nash, Jr. 

Michael Th. Rassias Editors 
Open Problems 
in Mathematics 


g) Springer 


Open Problems in Mathematics 


John Forbes Nash, Jr. ¢ Michael Th. Rassias 
Editors 


Open Problems 
in Mathematics 


D Springer 


Editors 


John Forbes Nash, Jr. (Deceased) Michael Th. Rassias 

Department of Mathematics Department of Mathematics 

Princeton University Princeton University 

Princeton, NJ, USA Princeton, NJ, USA 
ETH-Ziirich 


Department of Mathematics 
Ziirich, Switzerland 


ISBN 978-3-319-32160-8 ISBN 978-3-319-32162-2 (eBook) 
DOI 10.1007/978-3-319-32162-2 


Library of Congress Control Number: 2016941333 


© Springer International Publishing Switzerland 2016 

This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part of 
the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, 
broadcasting, reproduction on microfilms or in any other physical way, and transmission or information 
storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology 
now known or hereafter developed. 

The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication 
does not imply, even in the absence of a specific statement, that such names are exempt from the relevant 
protective laws and regulations and therefore free for general use. 

The publisher, the authors and the editors are safe to assume that the advice and information in this book 
are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or 
the editors give a warranty, express or implied, with respect to the material contained herein or for any 
errors or omissions that may have been made. 


Printed on acid-free paper 


This Springer imprint is published by Springer Nature 
The registered company is Springer International Publishing AG Switzerland 


Preface 


John Forbes Nash, Jr. and Michael Th. Rassias 


Learn from yesterday, live for today, hope for tomorrow. 
The important thing is not to stop questioning. 


— Albert Einstein (1879-1955) 


It has become clear to the modern working mathematician that no single researcher, 
regardless of his knowledge, experience, and talent, is capable anymore of overview- 
ing the major open problems and trends of mathematics in its entirety. The breadth 
and diversity of mathematics during the last century has witnessed an unprecedented 
expansion. 

In 1900, when David Hilbert began his celebrated lecture delivered before the 
International Congress of Mathematicians in Paris, he stoically said: 


Who of us would not be glad to lift the veil behind which the future lies hidden; to cast a 
glance at the next advances of our science and at the secrets of its development during future 
centuries? What particular goals will there be toward which the leading mathematical spirits 
of coming generations will strive? What new methods and new facts in the wide and rich 
field of mathematical thought will the new centuries disclose? 


Perhaps Hilbert was among the last great mathematicians who could talk about 
mathematics as a whole, presenting problems which covered most of its range at 
the time. One can claim this, not because there will be no other mathematicians 
of Hilbert’s caliber, but because life is probably too short for one to have the 
opportunity to expose himself to the allness of the realm of modern mathematics. 
Melancholic as this thought may sound, it simultaneously creates the necessity and 
aspiration for intense collaboration between researchers of different disciplines. 
Thus, overviewing open problems in mathematics has nowadays become a task 
which can only be accomplished by collective efforts. 

The scope of this volume is to publish invited survey papers presenting the status 
of some essential open problems in pure and applied mathematics, including old 
and new results as well as methods and techniques used toward their solution. One 
expository paper is devoted to each problem or constellation of related problems. 
The present anthology of open problems, notwithstanding the fact that it ranges 
over a variety of mathematical areas, does not claim by any means to be complete, 
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as such a goal would be impossible to achieve. It is, rather, a collection of 
beautiful mathematical questions which were chosen for a variety of reasons. Some 
were chosen for their undoubtable importance and applicability, others because 
they constitute intriguing curiosities which remain unexplained mysteries on the 
basis of current knowledge and techniques, and some for more emotional reasons. 
Additionally, the attribute of a problem having a somewhat vintage flavor was also 
influential in our decision process. 

The book chapters have been contributed by leading experts in the corresponding 
fields. We would like to express our deepest thanks to all of them for participating 
in this effort. 


Princeton, NJ, USA John F. Nash, Jr. 


April, 2015 
foln. §: Vor, OF 
Michael Th. Rassias 
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A Farewell to “A Beautiful Mind and a Beautiful 
Person’’ 


Michael Th. Rassias 


Having found it very hard to resign myself to John F. Nash’s sudden and so tragic 
passing, I postponed writing my commemorative addendum to our jointly composed 
preface until this compilation of papers on open problems was almost fully ready for 
publication. Now that I have finally built up my courage for coming to terms with 
John Nash’s demise, my name, which joyfully adjoins his at the end of the above 
preface, now also stands sadly alone below the following bit of reminiscence from 
my privileged year as his collaborator and frequent companion. 

It all started in September 2014, in one of the afternoon coffee/tea meetings 
that take place on a daily basis in the common room of Fine Hall, the building 
housing the Mathematics Department of Princeton University. John Nash silently 
entered the room, poured himself a cup of decaf coffee and then sat alone in a chair 
close by. That was when I first approached him and had a really pleasant chat about 
problems in the interplay of game theory and number theory. From that day onwards, 
our discussions became ever more frequent, and we eventually decided to prepare 
this volume Open Problems in Mathematics. The day we made this decision, he 
turned to me and said with his gentle voice, “I don’t want to be just a name on the 
cover though. I want to be really involved.” After that, we met almost daily and 
discussed for several hours at a time, examining a vast number of open problems in 
mathematics ranging over several areas. During these discussions, it became even 
clearer to me that his way of thinking was very different from that of almost all 
other mathematicians I have ever met. He was thinking in an unconventional, most 
creative way. His quick and distinctive mind was still shining bright in his later 
years. 

This volume was practically almost ready before John and Alicia Nash left in 
May for Oslo, where he was awarded the 2015 Abel Prize from the Norwegian 
Academy of Science and Letters. We had even prepared the preface of this volume, 
which he was so much looking forward to see published. Our decision to include 
handwritten signatures, as well, was along the lines of the somewhat vintage flavor 
and style that he liked. 

John Nash was planning to write a brief article on an open problem in game 
theory, which was the only problem we had not discussed yet. He was planning 
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to prepare it and discuss about it after his trip to Oslo. Thus, he never got the 
opportunity to write it. On this note, and notwithstanding my ‘last-minute’ invi- 
tation, Professor Eric Maskin generously accepted to contribute a paper presenting 
an important open problem in cooperative game theory. 

With this opportunity, I would also like to say just a few words about the man 
behind the mathematician. In the famous movie A Beautiful Mind, which portrayed 
his life, he was presented as a really combative person. It is true that in his early 
years he might have been, having also to battle with the demons of his illness. 
Being almost 60 years younger than him, I had the chance to get acquainted with 
his personality in his senior years. All the people who were around him, including 
myself, can avow that he was a truly wonderful person. Very kind and disarmingly 
simple, as well as modest. This is the reason why, among friends at Princeton, I 
used to humorously say that the movie should have been called A Beautiful Mind 
and a Beautiful Person. What was certainly true though was the dear love between 
John and Alicia Nash, who together faced and overcame the tremendous challenges 
of John Nash’s life. It is somehow a romantic tragedy that fate bound them to even 
leave this life together. 

In history, one can say that among the mathematicians who have reached 
greatness, there are some—a selected few—who have gone beyond greatness to 
become legends. John Nash was one such legend. 

The contributors of papers and myself cordially dedicate this volume to the 
memory and rich mathematical legacy of John F. Nash, Jr. 


Princeton, NJ, USA Michael Th. Rassias 


Introduction 
John Nash: Theorems and Ideas 


Misha Gromov 


Nash was not building big theories, he did not attempt to dislodge old concepts 
and to promote new ones, he didn’t try to be paradoxical. 

Nash was solving classical mathematical problems, difficult problems, some- 
thing that nobody else was able to do, not even to imagine how to do it. 

His landmark theorem of 1956—one of the main achievements of mathematics 
of the twentieth century—reads: 

All Riemannian manifolds X can be realised as smooth submanifolds in Euclidean spaces 

IR‘, such that the smoothness class of the submanifold realising an X in R? equals that of 


the Riemannian metric g on X and where the dimension q of the ambient Euclidean space 
can be universally bounded in terms of the dimension of X.! 


And as far as C!-smooth isometric embeddings f : X > R4 are concerned, there 
is no constraint on the dimension of the Euclidean space except for what is dictated 
by the topology of X: 

Every C!-smooth n-dimensional submanifold Xo in R? for g > n+ 1 can be deformed (by a 


C!-isotopy) to a new C!-position such that the induced Riemannian metric on Xj becomes 
equal to a given g.” 


At first sight, these are natural classically looking theorems. But what Nash 
has discovered in the course of his constructions of isomeric embeddings is far 
from “classical”—it is something that brings about a dramatic alteration of our 
understanding of the basic logic of analysis and differential geometry. Judging from 


M. Gromov 
IHES, 36 route de Chartres, 41990 Bures-sur-Yvette, France 


e-mail: gromov @ihes.fr 

'This was proven in the 1956 paper for C’-smooth metrics, r = 3,4,..., oo; the existence of 
real analytic isometric embeddings of compact manifolds with real analytic Riemannian metrics 
to Euclidean spaces was proven by Nash in 1966. 

Nash proved this in his 1954 paper for g > n + 2, where he indicated that a modification of his 
method would allow g = n + | as well. This was implemented in a 1955 paper by Nico Kuiper. 
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the classical perspective, what Nash has achieved in his papers is as impossible as 
the story of his life. 

Prior to Nash, the following two heuristic principles, vaguely similar to the 
first and the second laws of thermodynamics, have been (almost?) unquestionably 
accepted by analysts: 


1. Conservation of Regularity. The smoothness of solutions f of a “natural” 
functional, in particular a differential, equation D(f) = g is determined by the 
equation itself but not by a particular class of functions f used for the proof of 
the existence of solutions. 

2. Increase of Irregularity. If some amount of regularity of potential solutions f of 
our equations has been lost, it cannot be recaptured by any “external means,” 
such as artificial smoothing of functions. 


Instances of the first principle can be traced to the following three Hilbert’s 
problems: 

5th: Continuous groups are infinitely differentiable, in fact, real analytic. 

19th: Solutions of “natural” elliptic PDE are real analytic. 

Also Hilbert’s formulation of his 13th problem on 


non-representability of “interesting” functions in many variables 
by superpositions of continuous functions in fewer variables 


is motivated by this principle: 
continuous <> real analytic 


as far as superpositions of functions are concerned. 

Nash C!-isometric embedding theorem shattered the conservation of regularity 
idea: the system of differential equations that describes isometric immersions f : 
X — R¢ may have no analytic or not even C?-smooth solution f. 

But, according to Nash’s 1954 theorem, if gq > dim(X), and if X is diffeomorphic, 
to R", n < q, or to the n-sphere, then, no matter what Riemannian metric g you are 
given on this X, there are lots of isometric C!-embeddings X > R1.° 

Now, look at an equally incredible Nash’s approach to more regular, say C°°- 
smooth, isometric embeddings. The main lemma used by Nash, his implicit (or 
inverse) function theorem, may seem “classical” unless you read the small print: 

Let 9 : F > G be a C™-smooth non-linear differential operator between spaces 
F and G of C™-sections of two vector bundles over a manifold X. 


3In the spirit of Nash but probably independently, the continuous <> real analytic equivalence for 
superpositions of functions was disproved by Kolmogorov in 1956; yet, in essence, Hilbert’s 13th 
problem remains unsolved: are there algebraic (or other natural) functions in many variables that 
are not superpositions of real analytic functions in two variables? 

Also, despite an enormous progress, “true” Hilbert’s 19th problem remains widely open: what 
are possible singularities of solutions of elliptic PDE systems, such as minimal subvarieties and 
Einstein manifolds. 
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If the linearization 2 = Y%(f) of F at a point fo € F is invertible at 
80 = B(fo) € G by a differential operator linear in g, say M = -Mj,(g), then 
Q is also invertible (by a nonlinear non-differential operator) in a (small fine) C® - 
neighborhood of A(fo) € G. 

You must be a novice in analysis or a genius like Nash to believe anything like 
that can be ever true and/or to have a single nontrivial application. 

First of all, who has ever seen inversions of differential operators again by 
differential ones? 

And second of all, how on earth can Y be inverted by means of .@ when both 
operators, being differential, increase irregularity? 

But Nash writes down a simple formula for a linearized inversion .@ for 
the metric inducing operator Y, and he suggests a compensation for the loss of 
regularity by the use of smoothing operators. 

The latter may strike you as realistic as a successful performance of perpetuum 
mobile with a mechanical implementation of Maxwell’s demon...unless you start 
following Nash’s computation and realize to your immense surprise that the 
smoothing does work in the hands of John Nash. 

This, combined with a few ingenious geometric constructions, leads to C°™°- 
smooth isometric embeddings f : X > R¢ for g = 3n?/2 + O(n’), n = dim(X). 

Besides the above, Nash has proved a few other great theorems, but it is his work 
on isometric immersions that opened a new world of mathematics that stretches in 
front of our eyes in yet unknown directions and still waits to be explored. 


Scott Aaronson 


Abstract In 1950, John Nash sent a remarkable letter to the National Security 
Agency, in which—seeking to build theoretical foundations for cryptography—he 


all but formulated what today we call the P = NP problem, and consider one of 
the great open problems of science. Here I survey the status of this problem in 
2016, for a broad audience of mathematicians, scientists, and engineers. I offer a 
personal perspective on what it’s about, why it’s important, why it’s reasonable 
to conjecture that P # NP is both true and provable, why proving it is so hard, 
the landscape of related problems, and crucially, what progress has been made in 
the last half-century toward solving those problems. The discussion of progress 
includes diagonalization and circuit lower bounds; the relativization, algebrization, 
and natural proofs barriers; and the recent works of Ryan Williams and Ketan 
Mulmuley, which (in different ways) hint at a duality between impossibility proofs 
and algorithms. 


1 Introduction 


Now my general conjecture is as follows: for almost all sufficiently complex types of 
enciphering, especially where the instructions given by different portions of the key interact 
complexly with each other in the determination of their ultimate effects on the enciphering, 
the mean key computation length increases exponentially with the length of the key, or in 
other words, the information content of the key ... The nature of this conjecture is such that 
I cannot prove it, even for a special type of ciphers. Nor do I expect it to be proven.—John 
Nash, 1950 [171] 


In 1900, David Hilbert challenged mathematicians to design a “purely mechan- 
ical procedure” to determine the truth or falsehood of any mathematical statement. 


For a more recent version of this survey, with numerous corrections and revisions as well as new 
material, the reader is invited to visit http://www.scottaaronson.com/papers/pvsnp.pdf 
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That goal turned out to be impossible. But the question—does such a procedure 
exist, and why or why not?—helped launch three related revolutions that shaped the 
twentieth century: one in math and science, as reasoning itself became a subject of 
mathematical analysis; one in philosophy, as the results of Gédel, Church, Turing, 
and Post showed the limits of formal systems and the power of self-reference; and 
one in technology, as the electronic computer achieved, not all of Hilbert’s dream, 
but enough of it to change the daily experience of most people on earth. 


The P = NP problem is a modern refinement of Hilbert’s 1900 question. The 
problem was explicitly posed in the early 1970s in the works of Cook and Levin, 
though versions were stated earlier—including by Gédel in 1956, and as we see 


above, by John Nash in 1950. In plain language, P = NP asks whether there’s 
a fast procedure to answer all questions that have short answers that are easy to 
verify mechanically. Here one should think of a large jigsaw puzzle with (say) 10!0° 
possible ways of arranging the pieces, or an encrypted message with a similarly 
huge number of possible decrypts, or an airline with astronomically many ways of 
scheduling its flights, or a neural network with millions of weights that can be set 
independently. All of these examples share two key features: 


(1) a finite but exponentially-large space of possible solutions; and 

(2) a fast, mechanical way to check whether any claimed solution is “valid.” (For 
example, do the puzzle pieces now fit together in a rectangle? Does the proposed 
airline schedule achieve the desired profit? Does the neural network correctly 
classify the images in a test suite?) 


The P = NP question asks whether, under the above conditions, there’s a general 
method to find a valid solution whenever one exists, and which is enormously faster 
than just trying all the possibilities one by one, from now till the end of the universe, 
like in Jorge Luis Borges’ Library of Babel. 

Notice that Hilbert’s goal has been amended in two ways. On the one hand, 
the new task is “easier” because we’ve restricted ourselves to questions with only 
finitely many possible answers, each of which is easy to verify or rule out. On the 
other hand, the task is “harder” because we now insist on a fast procedure: one that 
avoids the exponential explosion inherent in the brute-force approach. 

Of course, to discuss such things mathematically, we need to pin down the 
meanings of “fast” and “mechanical” and “easily checked.’ As we’ll see, the 


P = NP question corresponds to one natural choice for how to define these 
concepts, albeit not the only imaginable choice. For the impatient, P stands for 
“Polynomial Time,” and is the class of all decision problems (that is, infinite sets of 
yes-or-no questions) solvable by a standard digital computer—or for concreteness, 
a Turing machine—using a polynomial amount of time. By this, we mean a number 
of elementary logical operations that is upper-bounded by the bit-length of the input 
question raised to some fixed power. Meanwhile, NP stands for ““Nondeterministic 
Polynomial Time,” and is the class of all decision problems for which, if the answer 
is “yes,” then there is a polynomial-size proof that a Turing machine can verify in 
polynomial time. It’s immediate that P C NP, so the question is whether this con- 
tainment is proper (and hence P 4 NP), or whether NP C P (and hence P = NP). 


P= NP 3 
1.1 The Importance of P = NP 


Before getting formal, it seems appropriate to say something about the significance 


of the P = NP question. P 2 NP, we might say, shares with Hilbert’s original 
question the character of a “math problem that’s more than a math problem”: a 
question that reaches inward to ask about mathematical reasoning itself, and also 
outward to everything from philosophy to natural science to practical computation. 

To start with the obvious, essentially all the cryptography that we currently use 
on the Internet—for example, for sending credit card numbers—would be broken if 
P = NP (and if, moreover, the algorithm were efficient in practice, a caveat we’ll 


return to later). Though he was writing 21 years before P = NP was explicitly 
posed, this is the point Nash was making in the passage with which we began. 

The reason is that, in most cryptography, the problem of finding the decryption 
key is an NP search problem: that is, we know mathematically how to check whether 
a valid key has been found. The only exceptions are cryptosystems like the one- 
time pad and quantum key distribution, which don’t rely on any computational 
assumptions (but have other disadvantages, such as the need for huge pre-shared 
keys or for special communication hardware). 


The metamathematical import of P = NP was also recognized early. It was 
articulated, for example, in Kurt Gédel’s now-famous 1956 letter to John von 


? 
Neumann, which sets out what we now call the P = NP question. Gédel wrote: 


If there actually were a machine with [running time] ~ Kn (or even only with ~ Kn’) 
[for some constant K independent of n], this would have consequences of the greatest 
magnitude. That is to say, it would clearly indicate that, despite the unsolvability of the 
Entscheidungsproblem, the mental effort of the mathematician in the case of yes-or-no 
questions could be completely [added in a footnote: apart from the postulation of axioms] 
replaced by machines. One would indeed have to simply select an n so large that, if the 
machine yields no result, there would then also be no reason to think further about the 
problem. 


Expanding on Gédel’s observation, some modern commentators have explained 


the importance of P = NP as follows. It’s well-known that P = NP is one of the 
seven Clay Millennium Problems (alongside the Riemann Hypothesis, the Yang- 
Mills mass gap, etc.), for which a solution commands a million-dollar prize [66]. 


But even among those problems, P = NP has a special status. For if someone 
discovered that P = NP, and if moreover the algorithm was efficient in practice, 
that person could solve not merely one Millennium Problem but all seven of them— 
for she’d simply need to program her computer to search for formal proofs of the 
other six conjectures.’ Of course, if (as most computer scientists believe) P # NP, 


‘Here we’re using the observation that, once we fix a formal system (say, first-order logic plus the 
axioms of ZF set theory), deciding whether a given statement has a proof at most n symbols long 
in that system is an NP problem, which can therefore be solved in time polynomial in n assuming 
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a proof of that would have no such world-changing implications, but even the fact 
that such a proof could rule out those implications underscores the enormity of what 
we're asking. 

I should be honest about the caveats. While theoretical computer scientists 
(including me!) have not always been above poetic flourish, P = NP is not 
quite equivalent to the questions of “whether human creativity can be automated,” 
or “whether anyone who can appreciate a symphony is Mozart, anyone who can 
recognize a great novel is Jane Austen.” Apart from the obvious point that no purely 
mathematical question could fully capture these imponderables, there are also more 
specific issues. 


For one thing, while P = NP has tremendous relevance to artificial intelligence, 
it says nothing about the differences, or lack thereof, between humans and machines. 
Indeed, P ~ NP would represent a limitation on all classical digital computation, 
one that might plausibly apply to human brains just as well as to electronic 
computers. Nor does P # NP rule out the possibility of robots taking over the 
world. To defeat humanity, presumably the robots wouldn’t need to solve arbitrary 
NP problems in polynomial time: they’d merely need to be smarter than us, and to 
have imperfect heuristics better than the imperfect heuristics that we picked up from 
a billion years of evolution! Conversely, while a proof of P = NP might hasten 


a robot uprising, it wouldn’t guarantee one. For again, what P = NP asks is not 
whether all creativity can be automated, but only that creativity whose fruits can be 
quickly checked by computer programs that we know how to write. 

To illustrate, suppose we wanted to program a computer to create new Mozart- 
quality symphonies and Shakespeare-quality plays. If P = NP, and the algorithm 
were efficient in practice, then that really would imply that these feats could be 
reduced to a seemingly-easier problem, of programming a computer to recognize 
such symphonies and plays when given them. And interestingly, P = NP might 
also help with the recognition problem: for example, by letting us train a neural 
network that reverse-engineered the expressed aesthetic preferences of hundreds of 
human experts. But how well that neural network would perform is an empirical 
question outside the scope of mathematics. 


1.2 Objections to P = NP 


After modest exposure to the P = NP problem, many people come up with what 
they consider an irrefutable objection to its phrasing or importance. Since the same 
objections tend to recur, in this section Ill collect the most frequent ones and make 
some comments about them. 


P = NP. We’re also assuming that the other six Clay conjectures have ZF proofs that are not 
too enormous: say, 10!? symbols or fewer, depending on the exact running time of the assumed 
algorithm. In the case of the Poincaré Conjecture, this can almost be taken to be a fact, modulo the 
translation of Perelman’s proof [179] into the language of ZF. 
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1.2.1 The Asymptotic Objection 


Objection P = NP talks only about asymptotics—i.e., whether the running time 
of an algorithm grows polynomially or exponentially with the size n of the question 
that was asked, as n goes to infinity. It says nothing about the number of steps needed 
for concrete values of n (say, a thousand or a million), which is all anyone would 
ever care about in practice. 


Response It was realized early in the history of computer science that “number of 
steps” is not a robust measure of hardness, because it varies too wildly from one 
machine model to the next (from Macs to PCs and so forth), and also depends 
heavily on low-level details of how the problem is encoded. The asymptotic 
complexity of a problem could be seen as that contribution to its hardness that 
is clean and mathematical, and that survives the vicissitudes of technology. Of 
course, real-world software design requires thinking about many non-asymptotic 
contributions to a program’s efficiency, from compiler overhead to the layout of the 
cache (as well as many considerations that have nothing to do with efficiency at all). 
But any good programmer knows that the asymptotics matter as well. 

More specifically, many people object to theoretical computer science’s equation 
of “polynomial” with “efficient” and “exponential” with “inefficient,” given that 
for any practical value of n, an algorithm that takes 1.0000001” steps is clearly 
preferable to an algorithm that takes n!°° steps. This would be a strong objection, 
if such algorithms were everyday phenomena. Empirically, however, computer 
scientists found that there is a strong correlation between “solvable in polynomial 
time” and “solvable efficiently in practice,” with most (but not all) problems in P that 
they care about solvable in linear or quadratic or cubic time, and most (but not all) 
problems outside P that they care about requiring c” time via any known algorithm, 
for some c significantly larger than 1. Furthermore, even when the first polynomial- 
time algorithm discovered for some problem takes (say) n° or n!° time, it often 
happens that later advances lower the exponent, or that the algorithm runs much 
faster in practice than it can be guaranteed to run in theory. This is what happened, 
for example, with linear programming, primality testing, and Markov Chain Monte 
Carlo algorithms. 

Having said that, of course the goal is not just to answer some specific question 


5 
like P = NP, but to learn the truth about efficient computation, whatever it might 
be. If practically-important NP problems turn out to be solvable in n!° time but 
not in n°”? time, or in 1.0000001” time, then so be it. From this perspective, one 


2 : : z 
could argue that P = NP simply serves as a marker of ignorance: in effect we are 
saying, “if we can’t even answer this, then surely we can’t answer the more refined 
questions either.” 
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1.2.2. The Polynomial-Time Objection 


Objection But why should we draw the border of efficiency at the polynomial 
functions, as opposed to any other class of functions—for example, functions upper- 
bounded by n?, or functions of the form n°” (called quasipolynomial functions)? 


Response There is a good theoretical answer to this: it’s because polynomials 
are the smallest class of functions that contain the linear functions, and that are 
closed under basic operations like addition, multiplication, and composition. For 
this reason, they are the smallest class that ensures that we can compose “efficient 
algorithms” a constant number of times, and still get an algorithm that is efficient 
overall. For the same reason, polynomials are also the smallest class that ensures that 
our “set of efficiently solvable problems” is independent of the low-level details of 
the machine model. 

Having said that, much of algorithms research is about lowering the order of 
the polynomial, for problems already known to be in P; and theoretical computer 
scientists do use looser notions like quasipolynomial time whenever they are needed. 


1.2.3 The Kitchen-Sink Objection 


Objection P = NP is limited, because it talks only about discrete, deterministic 
algorithms that find exact solutions in the worst case—and also, because it ignores 
the possibility of natural processes that might exceed the limits of Turing machines, 
such as analog computers, biological computers, or quantum computers. 


Response For every assumption mentioned above, there is now a major branch 
of theoretical computer science that studies what happens when one relaxes 
the assumption: for example, randomized algorithms, approximation algorithms, 
average-case complexity, and quantum computing. I'll discuss some of these 
branches in Sect.5. Briefly, though, there are deep reasons why many of these 


ideas are thought to leave the original P = NP problem in place. For example, 
according to the P = BPP conjecture (see Sect. 5.4.1), randomized algorithms yield 
no more power than P, while careful analyses of noise, energy expenditure, and the 
like suggest that the same is true for analog computers (see [3]). Meanwhile, the 
famous PCP Theorem and its offshoots (see Sect. 3) have shown that, for many NP 
problems, there cannot even be a polynomial-time algorithm to approximate the 
answer to within a reasonable factor, unless P = NP. 

In other cases, new ideas have led to major, substantive strengthenings of the P # 
NP conjecture (see Sect. 5): for example, that there exist NP problems that are hard 
even on random inputs, or hard even for a quantum computer. Of course, proving 
P + NP itself is a prerequisite to proving any of these strengthened versions. 

There’s one part of this objection that’s so common that it requires some separate 
comments. Namely, people will say that even if P # NP, in practice we can 
find almost always find good enough solutions to the problems we care about, for 
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example by using heuristics like simulated annealing or genetic algorithms, or by 
using special structure or symmetries in real-life problem instances. 

Certainly there are cases where this assumption is true. But there are also 
cases where it’s false: indeed, the entire field of cryptography is about making the 
assumption false! In addition, I believe our practical experience is biased by the fact 
that we don’t even try to solve search problems that we “know” are hopeless—yet 
that wouldn’t be hopeless in a world where P = NP (and where the algorithm was 
efficient in practice). For example, presumably no one would try using brute-force 
search to look for a formal proof of the Riemann Hypothesis one billion lines long or 
shorter, or a 10-megabyte program that reproduced most of the content of Wikipedia 
within a reasonable time (possibly needing to encode many of the principles of 
human intelligence in order to do so). Yet both of these are “merely” NP search 
problems, and things one could seriously contemplate in a world where P = NP. 


1.2.4 The Mathematical Snobbery Objection 


7 
Objection P = NP is not a “real” math problem, because it talks about 
Turing machines, which are arbitrary human creations, rather than about “natural” 
mathematical objects like integers or manifolds. 


Response The simplest reply is that P = NP is not about Turing machines at all, 
but about algorithms, which seem every bit as central to mathematics as integers 
or manifolds. Turing machines are just one particular formalism for expressing 
algorithms, as the Arabic numerals are one particular formalism for integers. And 
crucially, just like the Riemann Hypothesis is still the Riemann Hypothesis in base- 
17 arithmetic, so essentially every formalism for deterministic digital computation 
ever proposed gives rise to the same complexity classes P and NP, and the same 
question about whether they are equal. (This observation is known as the Extended 
Church-Turing Thesis.) 

This objection might also reflect lack of familiarity with recent progress in 
complexity theory, which has drawn on Fourier analysis, arithmetic combinatorics, 
representation theory, algebraic geometry, and dozens of other subjects about 
which yellow books are written. Furthermore, in Sect.6.6, we’ll see Geometric 
Complexity Theory (GCT), a breathtakingly ambitious program for proving P # 
NP that throws almost the entire arsenal of modern mathematics at the problem, 
including geometric invariant theory, plethysms, quantum groups, and Langlands- 
type correspondences. Regardless of whether GCT’s specific conjectures pan out, 
they illustrate in detail how progress toward proving P 4 NP will plausibly involve 
deep insights from many parts of mathematics. 


1.2.5 The Sour Grapes Objection 


j 
Objection P = NP is so hard that it’s impossible to make anything resembling 
progress on it, at least at this stage in human history—and for that reason, it’s 


8 S. Aaronson 


unworthy of serious effort or attention. Indeed, we might as well treat such questions 
as if their answers were formally independent of set theory, as for all we know they 
are (a possibility discussed further in Sect. 3.1). 


Response One of the main purposes of this survey is to explain what we know 


now, relevant to the P ES NP problem, that we didn’t know 10 or 20 or 30 
years ago. It’s true that, if “progress” entails having a solution already in sight, 
or being able to estimate the time to a solution, I know of no progress of that 
kind! But by the same standard, one would have to say there was no “progress” 
toward Fermat’s Last Theorem in 1900—even as mathematicians, partly motivated 
by Fermat’s problem, were laying foundations of algebraic number theory that did 
eventually lead to Wiles’s proof. In this survey, Pll try to convey how, over the last 
few decades, insights about circuit lower bounds, relativization and arithmetization, 
pseudorandomness and natural proofs, the “duality” between lower bounds and 
algorithms, the permanent and determinant manifolds, and more have transformed 
our understanding of what a P ~ NP proof could look like. 


I should point out that, even supposing P = NP is never solved, it’s already 
been remarkably fruitful as an “aspirational” or “flagship” question, helping to 
shape research in algorithms, cryptography, learning theory, derandomization, 
quantum computing, and other things that theoretical computer scientists work 
on. Furthermore, later we’ll see examples of how seemingly-unrelated progress in 
some of those other areas, unexpectedly ended up tying back to the quest to prove 
P ~ NP. 


1.2.6 The Obviousness Objection 


Objection It is intuitively obvious that P # NP. For that reason, a proof of 
P # NP—confirming that indeed, we can’t do something that no reasonable person 
would ever have imagined we could do—gives almost no useful information. 


Response This objection is perhaps less common among mathematicians than 
others, since were it upheld, it would generalize to almost all of mathematics! Like 
with most famous unsolved math problems, the quest to prove P # NP is “less 
about the destination than the journey”: there might or might not be surprises in the 
answer itself, but there will certainly be huge surprises (indeed, there already have 
been huge surprises) along the way. More concretely: to make a sweeping statement 
like P # NP, about what polynomial-time algorithms can’t do, will require an 
unprecedented understanding of what they can do. This will almost certainly entail 
the discovery of many new polynomial-time algorithms, some of which could 
have practical relevance. In Sect.6, we will see many more subtle examples of 
the “duality” between algorithms and impossibility proofs, with progress on each 
informing the other. 

Of course, to whatever extent you regard P = NP as a live possibility, the 
Obviousness Objection is not open to you. 
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1.2.7 The Constructivity Objection 


Objection Even if P = NP, the proof could be nonconstructive—in which case it 
wouldn’t have any of the amazing implications discussed in Sect. 1.1, because we 
wouldn’t know the algorithm. 


Response A nonconstructive proof that an algorithm exists is indeed a theoretical 
possibility, though one that has reared its head only a few times in the history of 
computer science.” Even then, however, once we knew that an algorithm existed, we 
would have a massive inducement to try to find it. The same is true if, for example, 
the first proof of P = NP only gave an n!°° algorithm, but we suspected that an n? 
algorithm existed.* 


1.3. Further Reading 


There were at least four previous major survey articles about P = NP: Michael 
Sipser’s 1992 “The History and Status of the P versus NP Question” [207]; 
Stephen Cook’s 2000 “The P versus NPProblem” [66], which was written for 
the announcement of the Clay Millennium Prize; Avi Wigderson’s 2006 “P, 


?The most celebrated examples of nonconstructive proofs that algorithms exist all come from 
the Robertson-Seymour graph minors theory, one of the great achievements of twentieth-century 
combinatorics (for an accessible introduction, see for example Fellows [75]). The Robertson- 
Seymour theory typically deals with parameterized problems: for example, “given a graph G, 
decide whether G can be embedded on a sphere with k handles.” In those cases, typically a 
fast algorithm A; can be abstractly shown to exist for every value of k. The central problem is 
that each A, requires hard-coded data—in the above example, a finite list of obstructions to the 
desired embedding—that no one knows how to find given k, and whose size might also grow 
astronomically as a function of k. On the other hand, once the finite obstruction set for a given k 


was known, one could then use it to solve the problem for any graph G in time O (ici’), where 


the constant hidden by the big-O depended on k. 

Robertson-Seymour theory also provides a few examples of non-parameterized problems that 
are abstractly proved to be in P but with no bound on the exponent, or abstractly proved to be 
O (n?) or O (n’) but with no bound on the constant. Thus, one cannot rule out the possibility that 
the same would happen with an NP-complete problem, and Donald Knuth [131] has explicitly 
speculated that P = NP will be proven in that way. To me, however, it is unclear whether he 
speculates this because there is a positive reason for thinking it true, or just because it would be 
cool and interesting if it was true. 


3As an amusing side note, there is a trick called Levin’s universal search [141], in which one 
“dovetails” over all Turing machines M,,Mo,... (that is, for all t, runs Mj,..., M,; for t steps 
each), halting when and if any ©; has outputs a valid solution to one’s NP search problem. If 
we know P = NP, then we know this particular algorithm will find a valid solution, whenever 
one exists, in polynomial time—because clearly some M; does so, and all the machines other than 
M; increase the total running time by “only” a polynomial factor! With more work, one can even 
decrease this to a constant factor. Admittedly, however, the polynomial or constant factor will be 
so enormous as to negate this algorithm’s practical use. 
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NP, and Mathematics—A Computational Complexity Perspective”[232]; and Eric 
Allender’s 2009 “A Status Report on the P versus NP Question” [21]. All four 
are excellent, so it’s only with trepidation that I add another entry to the crowded 
arena. I hope that, if nothing else, this survey shows how much has continued to 
occur. I cover several major topics that either didn’t exist a decade ago, or existed 
only in much more rudimentary form: for example, the algebrization barrier, “ironic 
complexity theory” (including Ryan Williams’s NEXP ¢ ACC result), the “chasm 
at depth three” for the permanent, and the Mulmuley-Sohoni Geometric Complexity 
Theory program. 


The seminal papers that set up the intellectual framework for P 2 NP, posed 
it, and demonstrated its importance include those of Edmonds [74], Cobham [65], 
Cook [67], Karp [122], and Levin [141]. See also Trakhtenbrot [223] for a survey 
of Soviet thought about perebor, as brute-force search was referred to in Russian in 
the 1950s and 60s. 

The classic text that introduced the wider world to P, NP, and NP-completeness, 
and that gave a canonical (and still-useful) list of hundreds of NP-complete prob- 
lems, is Garey and Johnson [86]. Some recommended computational complexity 
theory textbooks—in rough order from earliest to most recent, in the material they 
cover—are Sipser [208], Papadimitriou [175], Sch6ning [199], Moore and Mertens 
[155], and Arora and Barak [27]. Surveys on particular aspects of complexity theory 
will be recommended where relevant throughout the survey. 


Those seeking a nontechnical introduction to P = NP might enjoy Lance 
Fortnow’s charming book The Golden Ticket [80], or his 2009 popular article for 
Communications of the ACM [79]. My own Quantum Computing Since Democritus 
[6] gives something between a popular and a technical treatment. 


2 Formalizing P = NP and Central Related Concepts 


The P = NP problem is normally phrased in terms of Turing machines: a theoretical 
model of computation proposed by Alan Turing in 1936, which involves a one- 
dimensional tape divided into discrete squares, and a finite control that moves back 
and forth on the tape, reading and writing symbols. For a formal definition, see, e.g., 
Sipser [208] or Cook [66]. 

In this survey, I won’t define Turing machines, for the simple reason that if you 
know any programming language—C, Java, Python, etc.—then you already know 
something that’s equivalent to Turing machines for our purposes. More precisely, 
the Church-Turing Thesis holds that virtually any model of digital computation one 
can define will be equivalent to Turing machines, in the sense that Turing machines 
can simulate that model and vice versa. A modern refinement, the Extended Church- 
Turing Thesis, says that moreover, these simulations will incur at most a polynomial 
overhead in time and memory. If we accept this, then there’s a well-defined notion 
of “solvable in polynomial time by a digital computer,’ which is independent of 


P= NP 11 


the low-level details of the computer’s architecture: the instruction set, the rules for 
accessing memory, etc. This licenses us to ignore those details. The main caveats 
here are that 


(1) the computer must be classical, discrete, and deterministic (it’s not a quantum 
computer, an analog device, etc., nor can it call a random-number generator or 
any other external resource), and 

(2) there must be no a-priori limit on how much memory the computer can address, 
even though any program that runs for finite time will only address a finite 
amount of memory.*:> 


We can now define P and NP, in terms of Turing machines for concreteness— 
but, because of the Extended Church-Turing Thesis, the reader is free to substitute 
other computing formalisms such as Lisp programs, A-calculus, stylized assembly 
language, or cellular automata. 

A language is a set of binary strings, L C {0,1}*, where {0, 1}* is the set of 
all binary strings of all (finite) lengths. Of course a language can be infinite, even 
though every string in the language is finite. One example is the language consisting 
of all palindromes: for instance, 00, 11, 0110, 11011, etc., but not 001 or 1100. A 
more interesting example is the language consisting of all binary encodings of prime 
numbers: for instance, 10, 11, 101, and 111, but not 100. 

A binary string x € {0, 1}*, for which we want to know whether x € L, is called 
an instance of the general problem of deciding membership in L. Given a Turing 
machine M and an instance x, we let M (x) denote M run on input x (say, on a tape 
initialized to --- O#x#0---, or x surrounded by delimiters and blank or 0 symbols). 
We say that M (x) accepts if it eventually halts and enters an “accept” state, and we 
say that M decides the language L if for all x € {0, 1}*, 


xeL => M(x)accepts. 


The machine M may also contain a “reject” state, which M enters to signify that 
it has halted without accepting. Let |x| be the length of x (i.e., the number of bits). 
Then we say M is polynomial-time if there exists a polynomial p such that M (x) 
halts, either accepting or rejecting, after at most p (|x|) steps, for all x € {0, 1}*. 


“The reason for this caveat is that, if a programming language were inherently limited to (say) 
64K of memory, there would be only finitely many possible program behaviors, so in principle we 
could just cache everything in a giant lookup table. Many programming languages do impose a 
finite upper bound on the addressable memory, but they could easily be generalized to remove this 
restriction (or one could consider programs that store information on external I/O devices). 

5] should stress that, once we specify which computational models we have in mind—Turing 
machines, Intel machine code, etc.—the polynomial-time equivalence of those models is typically 
a theorem, though a rather tedious one. The “thesis” of the Extended Church-Turing Thesis, the 
part not susceptible to proof, is that all other “reasonable” models of digital computation will also 
be equivalent to those models. 
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Now, P, or Polynomial-Time, is the class of all languages L for which there 
exists a Turing machine M that decides L in polynomial time. Also, NP, or 
Nondeterministic Polynomial-Time, is the class of languages L for which there 
exists a Turing machine M, and a polynomial p, such that for all x € {0, 1}*, 


xEL => Awe {0, 1? 


M (x, w) accepts. 

In other words, NP is the class of languages L for which, whenever x € L, 
there exists a polynomial-size “witness string” w, which enables a polynomial-time 
“verifier” M to recognize that indeed x € L. Conversely, whenever x ¢ L, there must 
be no w that causes M (x, w) to accept. 

There is an earlier definition of NP, which explains its ungainly name. Namely, 
we can define a nondeterministic Turing machine as a Turing machine that “when 
it sees a fork in the road, takes it”: that is, that is allowed to transition from 
a single state at time ¢ to multiple possible states at time ¢ + 1. We say that a 
machine “accepts” its input x, if there exists a list of valid transitions between states, 
5S} —> S2 —> $3 — -:-, that the machine could make on input x that terminates in 
an accepting state S Accept. The machine “rejects” if there is no such accepting path. 
The “running time” of such a machine is the maximum number of steps taken along 
any path, until the machine either accepts or rejects. We can then define NP as the 
class of all languages L for which there exists a nondeterministic Turing machine 
that decides L in polynomial time. It is clear that NP, so defined, is equivalent to 
the more intuitive verifier definition that we gave earlier. In one direction, if we 
have a polynomial-time verifier M, then a nondeterministic Turing machine can 
create paths corresponding to all possible witness strings w, and accept if and only 
if there exists a w such that M (x, w) accepts. In the other direction, if we have a 
nondeterministic Turing machine M’, then a verifier can take as its witness string w 
a description of a claimed path that causes M’ (x) to accept, then check that the path 
indeed does so. 

Clearly P C NP, since an NP verifier M can just ignore its witness w, and try to 
decide in polynomial time whether x ¢€ L itself. The central conjecture is that this 
containment is strict. 


Conjecture 1. P # NP. 


2.1 NP-Completeness 


A further concept, not part of the statement of P = NP but central to any discussion 
of it, is NP-completeness. To explain this requires a few more definitions. An oracle 
Turing machine is a Turing machine that, at any time, can submit an instance x to 
an “oracle”: a device that, in a single time step, returns a bit indicating whether x 
belongs to some given language L. An oracle that answers all queries consistently 
with L is called an L-oracle, and we write M" to denote the (oracle) Turing machine 
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Fig. 1 P, NP, NP-hard, and 
NP-complete 


NP-hard 


NP-complete 


M with L-oracle. We can then define P”, or P relative to L, as the class of all 
languages L’ for which there exists an oracle machine M such that M* decides 
L’ in polynomial time. If L’ € P*”, then we also write L’ = L, which means 
“L’ is polynomial-time Turing-reducible to L.” Note that polynomial-time Turing- 
reducibility is indeed a partial order relation (i.e., it is transitive and reflexive). 

A language L is NP-hard (technically, NP-hard under Turing reductions®) if 
NP c P®. Informally, NP-hard means “at least as hard as any NP problem”: if we 
had a black box for an NP-hard problem, we could use it to solve all NP problems in 
polynomial time. Also, L is NP-complete if L is NP-hard and L € NP. Informally, 
NP-complete problems are the hardest problems in NP. (See Fig. 1.) 

A priori, it is not completely obvious that NP-hard or NP-complete problems 
even exist. The great discovery of theoretical computer science in the 1970s was 
that hundreds of problems of practical importance fall into these classes: indeed, 
what is unusual is to find a hard NP problem that is not NP-complete. 

More concretely, consider the following languages: 


¢ 3SAT is the language consisting of all encodings of Boolean formulas g over 
n variables, which consist of ANDs of “3-clauses” (i.e., ORs of up to three 
variables or their negations), such that there exists at least one assignment that 
satisfies p. Here is an example, for which one can check that there’s no satisfying 
assignment: 


KVYVZIARVIVIAAVYHNAEVYAOVIDA(GVZ) 


®In practice, often one only needs a special kind of Turing reduction called a many-one reduction 
or Karp reduction, which is a polynomial-time algorithm that maps every yes-instance of L’ to a 
yes-instance of L, and every no-instance of L’ to a no-instance of L. The additional power of Turing 
reductions—to make multiple queries to the L-oracle (with later queries depending on the outcomes 
of earlier ones), post-process the results of those queries, etc.—is needed only in a minority of 
cases. Nevertheless, for conceptual simplicity, throughout this survey I'll talk in terms of Turing 
reductions. 


S. Aaronson 


HAMILTONCYCLE is the language consisting of all encodings of undirected 
graphs, for which there exists a cycle that visits each vertex exactly once (a 
Hamilton cycle). 

TSP (Traveling Salesperson Problem) is the language consisting of all encodings 
of ordered pairs (G, k), such that G is a graph with positive integer weights, k is 
a positive integer, and G has a Hamilton cycle of total weight at most k. 

CLIQUE is the language consisting of all encodings of ordered pairs (G, k), such 
that G is an undirected graph, k is a positive integer, and G contains a clique with 
at least k vertices. 

SUBSETSUM is the language consisting of all encodings of positive integer tuples 
(a1,..., 4x, b), for which there exists a subset of the a;’s that sums to b. 

3COL is the language consisting of all encodings of undirected graphs G that are 
3-colorable (that is, the vertices of G can be colored red, green, or blue, so that 
no two adjacent vertices are colored the same) 


All of these languages are easily seen to be in NP. The famous Cook-Levin 


Theorem says that one of them—3SAT—is also NP-hard, and hence NP-complete. 


Theorem 2 (Cook-Levin Theorem [67, 141]). 3SAT is NP-complete. 


A proof of Theorem 2 can be found in any theory of computing textbook (for 


example, [208]). Here I'll confine myself to saying that Theorem 2 can be proved in 
three steps, each of them routine from today’s standpoint: 


(1) 


(2 


Ym 


(3) 


One constructs an artificial language that is “NP-complete essentially by 
definition”: for example, 


L= {((M) ,x, 0°, 0') : dw € {0, 1}° suchthatM (x, w) acceptsin < t steps} : 


where (M) is a description of the Turing machine M. 

One then reduces L to the CIRCUITSAT problem, where we are given as input 
a description of a Boolean circuit C built of AND, OR, and NOT gates, and 
asked whether there exists an assignment x € {0,1}" for the input bits such 
that C(x) = 1. To do that, in turn, is more like electrical engineering than 
mathematics: given a Turing machine M, one simply builds up a Boolean logic 
circuit that simulates the action of M on the input (x, w) for ¢ time steps, whose 
size is polynomial in the parameters |(M)|, |x|, s, and t, and which outputs 1 if 
and only if M ever enters its accept state. 

Finally, one reduces CIRCUITSAT to 3SAT, by creating a new variable for 
each gate in the Boolean circuit C, and then creating clauses to enforce that 
the variable for each gate G equals the AND, OR, or NOT (as appropriate) 
of the variables for G’s inputs. For example, one can express the constraint 
aNb=cby 


(ave) A(bVe)A (GV bVe). 
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One then constrains the variable for the final output gate to be 1, yielding a 
3SAT instance ¢ that is satisfiable if and only if the CIRCUITSAT instance was 
(i.e., iff there existed an x such that C (x) = 1). 


Note that the algorithms to reduce L to CIRCUITSAT and to 3SAT—i.e., to convert 
M to C and C to y—tun in polynomial time (actually linear time), so we do indeed 
preserve NP-hardness. Also, the reason for the 3 in 3SAT is simply that a Boolean 
AND or OR gate has one output bit and two input bits, so it relates three bits in total. 
The analogous 2SAT problem turns out to be in P. 

Once one knows that 3SAT is NP-complete, “the floodgates are open.” One can 
then prove that countless other NP problems are NP-complete by reducing 3SAT to 
them, and then reducing those problems to others, and so on. The first indication of 
how pervasiveness NP-completeness really was came from Karp [122] in 1972. He 
showed, among many other results: 


Theorem 3 (Karp [122]). HAMILTONCYCLE, TSP, CLIQUE, SUBSETSUM, and 
3COL are all NP-complete. 


Today, so many combinatorial search problems have been proven NP-complete 
that, whenever one encounters a new such problem, a useful rule of thumb is that 
it’s “NP-complete unless it has a good reason not to be”! 

Note that, if any NP-complete problem is in P, then all of them are, and P = NP. 
Conversely, if any NP-complete problem is not in P, then none of them are, and 
P ~ NP. 

One application of NP-completeness is to reduce the number of logical quanti- 
fiers needed to state the P 4 NP conjecture. Let PT be the set of all polynomial- 
time Turing machines, and given a language L, let L(x) = lif x ¢ Land L(x) = 0 
otherwise. Then a “naive” statement of P 4 NP would be 


ALE NPVMe PT AxM (x) LQ). 
(Here, by quantifying over all languages in NP, we really mean quantifying over all 
verification algorithms that define such languages.) Once we know that 3SAT (for 
example) is NP-complete, we can state P # NP as simply: 


VM € PT Ax M (x) # 3Sat (x). 


In words, we can pick any NP-complete problem we like; then P 4 NP is equivalent 
to the statement that that problem is not in P. 


2.2 Other Core Concepts 


? 
A few more concepts give a fuller picture of the P = NP question, and will be 
referred to later in the survey. In this section, we restrict ourselves to concepts that 


were explored in the 1970s, around the same time as P af NP itself was formulated, 
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5 
and that are covered alongside P = NP in any undergraduate textbook. Other 
important concepts, such as nonuniformity, randomness, and one-way functions, 
will be explained as needed in Sect. 5. 


2.2.1. Search, Decision, and Optimization 


For technical convenience, P and NP are defined in terms of languages or “decision 
problems,” which have a single yes-or-no bit as the desired output (i.e., given an 
input x, is x € L?). To put practical problems into this decision format, typically 
we ask something like: does there exist a solution that satisfies the following list 
of constraints? But of course, in real life we don’t merely want to know whether 
a solution exists; we want to find a solution whenever there is one! And given the 
many examples in mathematics where explicitly finding an object is harder than 
proving its existence, one might worry that this would also occur here. Fortunately, 
though, shifting our focus from decision problems to search problems doesn’t 


change the P = NP question at all, because of the following classic observation. 
Proposition 4. Jf P = NP, then for every language L € NP (defined by a 


verifier M), there is a polynomial-time algorithm that actually finds a witness 
w € {0, 1} such that M (x, w) accepts, for all x € L. 


Proof. The idea is to learn the bits of an accepting witness w = w; -+- Wn) one by 
one, by asking a series of NP decision questions. For example: 


¢ Does there exist a w such that M (x, w) accepts and w; = 0? 
If the answer is “yes,” then next ask: 

¢ Does there exist a w such that M (x, w) accepts, w; = 0, and w2 = 0? 
Otherwise, next ask: 

¢ Does there exist a w such that M (x, w) accepts, w; = 1, and w2 = 0? 


Continue in this manner until all p (n) bits of w have been set. (This can also be 
seen as a binary search on the set of all 2? possible witnesses.) a 


Note that there are problems for which finding a solution is believed to be 
much harder than deciding whether one exists. A classic example, as it happens, 
is the problem of finding a Nash equilibrium of a matrix game. Here Nash’s 
theorem guarantees that an equilibrium always exists, but an important 2006 result 
of Daskalakis et al. [71] gave evidence that there is no polynomial-time algorithm 
to find an equilibrium.’ The upshot of Proposition 4 is just that search and decision 
are equivalent for the NP-complete problems. 


7Technically, Daskalakis et al. showed that the search problem of finding a Nash equilibrium is 
complete for a complexity class called PPAD. This could be loosely interpreted as saying that the 
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In practice, perhaps even more common than search problems are optimization 
problems, where we have some efficiently-computable cost function, say C : 
{0,1}" > {0,1,...,2?™}, and the goal is to find a solution x € {0,1}” that 
maximizes C (x). Fortunately, we can always reduce optimization problems to 
search and decision problems, by simply asking to find a solution x such that 
C(x) = K, and doing a binary search to find the largest K for which such an x 
still exists. So again, if P = NP then all NP optimization problems are solvable 
in polynomial time. On the other hand, it is important to remember that, while “is 
there an x such that C (x) > K?” is an NP question, “does max, C (x) = K?” and 
“does x* maximize C (x)?” are presumably not NP questions, because no single x 
is a witness to a yes-answer. 

More generally, the fact that decision, search, and optimization all hinge on the 


same P = NP question has meant that many people—including experts—freely 
abuse language by referring to search and optimization problems as “NP-complete.” 
Strictly they should call such problems NP-hard, while reserving “NP-complete” for 
suitable associated decision problems. 


2.2.2 The Twilight Zone: Between P and NP-complete 


We say a language L is NP-intermediate if L € NP, but L is neither in P nor 
NP-complete. One might hope, not only that P #4 NP, but that there would be 
a dichotomy, with all NP problems either in P or else NP-complete. However, a 
classic result by Ladner [135] rules that possibility out. 


Theorem 5 (Ladner [135]). Jf P 4 NP, then there exist NP-intermediate lan- 
guages. 


While Theorem 5 is theoretically important, the NP-intermediate problems that it 
yields are extremely artificial (requiring diagonalization to construct). On the other 
hand, as we’ll see, there are also problems of real-world importance—particularly 
in cryptography and number theory—that are believed to be NP-intermediate, and a 
proof of P # NP could leave the status of those problems open. (Of course, a proof 
of P = NP would mean there were no NP-intermediate problems, since every NP 
problem would then be both NP-complete and in P.) 


2.2.3. coNP and the Polynomial Hierarchy 


Let L = {0, 1}* \ L be the complement of L: that is, the set of strings not in L. Then 
the complexity class 


problem is “as close to NP-hard as it could possibly be, subject to Nash’s theorem showing why 
the decision version is trivial.” 
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coNP := {L: L€ NP} 


consists of the complements of all languages in NP. Note that this is not the same as 
NP, the set of all non-NP languages! Rather, L € coNP means that whenever x ¢ L, 
there’s a short proof of non-membership that can be efficiently verified. 

A natural question is whether NP is closed under complement: that is, whether 
NP = coNP. If P = NP, then certainly P = coNP, and hence NP = coNP also. 
On the other hand, we could imagine a world where NP = coNP even though 
P # NP. In that world, there would always be short proofs of unsatisfiability (or 
of the nonexistence of cliques, Hamilton cycles, etc.), but those proofs could be 
intractable to find. A generalization of the P ¢ NP conjecture says that this doesn’t 
happen: 


Conjecture 6. NP 4 coNP. 


A further generalization of P, NP, and coNP is the polynomial hierarchy PH. 
Defined by analogy with the arithmetic hierarchy in computability theory, PH is an 
infinite sequence of classes whose zeroth level equals P, and whose kth level (for 
k > 1) consists of all problems that are in P’ or NP“ or coNP*, for some language 
L in the (k — 1)st level. More succinctly, we write p24 = P, and 


AP = P21, DP =NPZ1, TIP = coNP™1 
for all k > 1.8 A more intuitive definition of PH is as the class of languages that are 
definable using a polynomial-time predicate with a constant number of alternating 
universal and existential quantifiers: for example, L € iwi if and only if there exists 
a polynomial-time machine M and polynomial p such that for all x, 


xeL — > Ywe {0, 1") az € (0, 1") M(x, w,z) accepts. 


NP is then the special case with just one existential quantifier, over witness strings w. 
If P = NP, then the entire PH “recursively unwinds” down to P: for example, 


=P = NPXP — NPP = NP =P. 


Moreover, one can show that if EP = TP or UP = pn for any k, then all the 
levels above the kthcome “crashing down” to UP = Hf. On the other hand, a 
collapse at the kth level isn’t known to imply a collapse at any lower level. Thus, 
we get an infinite sequence of stronger and stronger conjectures: first P #4 NP, 
then NP ¥ coNP, then = x ne. and so on. In the limit, we can conjecture the 
following: 


5In defining the kth level of the hierarchy, we could also have given oracles for TP_, rather than 
Df: it doesn’t matter. Note also that “an oracle for complexity class C” should be read as “an 
oracle for any C-complete language L.” 
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Conjecture 7. All the levels of PH are distinct—i.e., the infinite hierarchy is strict. 


This is a generalization of P # NP that many computer scientists believe, and 
that has many useful consequences that aren’t known to follow from P ¥ NP itself. 

It’s also interesting to consider NP N coNP, which is the class of languages that 
admit short, easily-checkable proofs for both membership and non-membership. 
Here is yet another strengthening of the P 4 NP conjecture: 


Conjecture 8. P 4 NPM coNP. 


Of course, if NP = coNP, then the P = NPNcoNP question becomes equivalent 


to the original P = NP question. But it’s conceivable that P = NP M coNP even if 
NP + coNP (Fig. 2). 


2.2.4 Factoring and Graph Isomorphism 


As an application of these concepts, let’s consider two languages that are suspected 
to be NP-intermediate. First, FAC—a language variant of the factoring problem— 
consists of all ordered pairs of positive integers (N, k) such that N has a nontrivial 
divisor at most k. Clearly a polynomial-time algorithm for FAC can be converted 
into a polynomial-time algorithm to output the prime factorization (by repeatedly 
doing binary search to peel off N’s smallest divisor), and vice versa. Second, 
GRAPHISO—that is, graph isomorphism—consists of all encodings of pairs of 
undirected graphs (G,H)}, such that G ~ H. It’s easy to see to see that FAC and 
GRAPHISO are both in NP. 

More interestingly, FAC is actually in NP M coNP. For one can prove that 
(N,k) €FAC by exhibiting the unique prime factorization of N, and showing that 
it only involves primes greater than k.” But this has the striking consequence that 
factoring cannot be NP-complete unless NP = coNP. The reason is the following. 


Fig. 2. The polynomial : 
hierarchy 


°This requires one nontrivial result, that every prime number has a succinct certificate—or in other 
words, that primality testing is in NP [180]. Since 2002, it is even known that primality testing is 
in P [14]. 
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Proposition 9. Jf any NPM coNP language is NP-complete, then NP = coNP, and 
hence PH collapses to NP. 


Proof. Suppose L € NP NM coNP. Then P“ C NPM coNP, since one can prove the 
validity of every answer to every query to the L-oracle (whether the answer is ‘yes’ 
or ‘no’). So if NP C P®, then NP C NPM coNP and hence NP = coNP. |_| 


GRAPHISO is not quite known to be in NP MN coNP. However, it has been proven 
to be in NPM coNP under a plausible assumption about pseudorandom generators 
[130]—and even with no assumptions, Boppana, Hastad, Zachos [49] proved the 
following. 


Theorem 10 ([49]). Jf GRAPHISO is NP-complete, then PH collapses to ae 


As this survey was being written, Babai [32] announced the following break- 
through result. 


Theorem 11 (Babai [32]). GRAPHISO is solvable in nP°'"8" time. 


Of course, this gives even more dramatic evidence that GRAPHISO is not NP- 
complete: if it was, then all NP problems would be solvable in n?°'Y'8” time as 
well. 


2.2.5 Space Complexity 


PSPACE is the class of languages L decidable by a Turing machine that uses a 
polynomial number of bits of space or memory, with no restriction on the number 
of time steps. Certainly P C PSPACE, since in ¢ time steps, a serial algorithm can 
access at most f memory cells. More generally, it is not hard to see that P C NP C 
PH C PSPACE, but none of these containments have been proved to be strict. 
The following conjecture—asserting that polynomial space is strictly stronger than 
polynomial time—is perhaps second only to P ¥ NP itself in notoriety. 


Conjecture 12. P # PSPACE. 


If P * NP, then certainly P 4 PSPACE as well, but the converse is not known. 

One can also define a nondeterministic variant of PSPACE, called NPSPACE. 
But a 1970 result called Savitch’s Theorem [197] shows that actually PSPACE = 
NPSPACE.!° The reasons for this are extremely specific to space, and do not seem 
to suggest any avenue to proving P = NP, the analogous statement for time. 


‘0A further surprising result from 1987, called the Immerman-Szelepcsényi Theorem [110, 218], 
says that NSPACE (f (n)) = coNSPACE (f (n)) for every “reasonable” memory bound f (n). (By 
contrast, Savitch’s Theorem produces a quadratic blowup when simulating nondeterministic space 
by deterministic space, and it remains open whether that blowup can be removed.) This further 
illustrates how space complexity behaves differently than we expect time complexity to behave. 


P= NP 21 
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Given an NP search problem, besides asking whether a solution exists, it is also 
natural to ask how many solutions there are. To capture this, in 1979 Valiant [226] 
defined the class #P (pronounced “‘sharp-P”’) of combinatorial counting problems. 
Formally, a function f : {0,1}* — N is in #P if and only if there is a polynomial- 
time Turing machine M, and a polynomial p, such that for all x € {0, 1}*, 


fW= I{w e {0, 1°") : M(x, w) accepts} | : 


Note that, unlike P, NP, and so on, #P is not a class of languages (i.e., decision 
problems). However, there are two ways we can compare #P to language classes. 
The first is by considering P*?: that is, P with a #P oracle. We then have NP C 
P#P Cc PSPACE, as well as the following highly non-obvious inclusion, called 
Toda’s Theorem. 


Theorem 13 (Toda [222]). PH < P#P. 


The second way is by considering a complexity class called PP (Probabilistic 
Polynomial-Time). PP can be defined as the class of languages L C {0,1}* for 
which there exist #P functions f and g such that for all inputs x € {0, 1}*, 


xeL<s > f(x) =>g). 


It is not hard to see that NP C PP C P#?. More interestingly, one can use binary 
search to show that PPP = P#P, so in that sense PP is “almost as strong as #P.” 

In practice, given any known NP-complete problem (3SAT, CLIQUE, SUB- 
SETSUM, etc.), the counting version of that problem (denoted #3SAT, #CLIQUE, 
#SUBSETSUM, etc.) will be #P-complete. Indeed, it is open whether there is any 
NP-complete problem that violates that rule. However, the converse is false: for 
example, the problem of deciding whether a graph has a perfect matching is in 
P, but Valiant [226] showed that counting the number of perfect matchings is #P- 
complete. 

The #P-complete problems are believed to be “genuinely much harder” than the 
NP-complete problems, in the sense that—in contrast to the situation with PH— 
even if P = NP we would still have no idea how to prove P = P*?. On the other 
hand, we do have the following nontrivial result. 


Theorem 14 (Stockmeyer [212]). Suppose P = NP. Then in polynomial time, we 
could approximate any #P function to within a factor of 1 + ¢, for any ¢ = 1/n°, 
2.2.7 Beyond Polynomial Resources 


Of course, one can consider many other time and space bounds besides polynomial. 
Before entering into this, I should offer a brief digression on the use of asymptotic 
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notation in theoretical computer science, since such notation will also be used later 
in the survey. 


¢ f(n) is O(g(n)) if there exist nonnegative constants A,B such that f(n) < 
Ag (n) + B for all n (i.e., g is an asymptotic upper bound on f). 

* f(n) is Q (g (n)) if g (n) is O(f (n)) Ge., g is an asymptotic lower bound on f). 

° f(n) is © (g (n)) if f (n) is O(g (n)) and g (n) is O(f (n)) (e., f and g grow at 
the same asymptotic rate). 

° f (n) is o(g (n)) if for all positive A, there exists a B such that f (n) < Ag (n) +B 
for all n (.e., g is a strict asymptotic upper bound on f). 


Now let TIME (f (7)) be the class of languages decidable in O(f (n)) time, 
let NTIME (f (m)) be the class decidable in nondeterministic O(f (n)) time 
that is, with a witness of size O(f (n)) that is verified in O(f (m)) time—and 
let SPACE (f (n)) be the class decidable in O (f (n)) space.'' We can then write 
P=), TIME (n‘) and NP = (J, NTIME (n*) and PSPACE = (), SPACE (n*). It 
is also interesting to study the exponential versions of these classes: 


= nk 
EXP = TIME (2 ) 


_ nk 
NEXP = U NTIME (2 ) 


EXPSPACE = |_)SPACE (2") 
k 


Note that by “exponential,” here we mean not just 2°”), but 2? for any polyno- 
mial p. 

Along with P C PSPACE, there is another fundamental relation between time 
and space classes: 


Proposition 15. PSPACE © EXP. 


Proof. Consider a deterministic machine whose state can be fully described by p (n) 
bits of information (e.g., the contents of a polynomial-size Turing machine tape, 
plus a few extra bits for the location and internal state of tape head). Clearly such a 
machine has at most 2?” possible states. Thus, after 2’ steps, either the machine 
has halted, or else it has entered an infinite loop and will never accept. So to decide 
whether the machine accepts, it suffices to simulate it for 2?” steps. B 


Unlike P or PSPACE, classes like TIME (n?), SPACE (n°), etc. can be sensitive to whether we 
are talking about Turing machines, RAM machines, or some other model of computation. But in 
any case, one can simply fix one of those models any time the classes are mentioned in this survey, 
and nothing will go wrong. 
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More generally, we get an infinite interleaved hierarchy of deterministic, nonde- 
terministic, and space classes: 


P CNP C PSPACE © EXP C NEXP C EXPSPACE C --- 


There is also a “higher-up” variant of the P 4 NP conjecture, which not surprisingly 
is also open: 


Conjecture 16. EXP # NEXP. 


‘ 
We can at least prove a close relationship between the P=NP and 


EXP = NEXP problems, via a trick called “padding” or “upward translation”: 
Proposition 17. Jf P = NP, then EXP = NEXP. 


Proof. Let L € NEXP, and let its verifier run in 2?” time for some polynomial p. 
Then consider the language 


jr le He u 


which consists of the inputs in L, but “padded out with an exponential number of 
trailing zeroes.” Then L’ € NP, since verifying that x € {0, 1}” is in L takes 2?” 
time, which is linear in n + 2? (the length of x07"), So by assumption, L’ € P 
as well. But this means that L € EXP, since given x € {0,1}", we can simply 
pad x out with 2?” trailing zeroes ourselves, then run the algorithm that takes time 
polynomial in n + 2?” BH 


For the same reason, if P = PSPACE, then EXP = EXPSPACE. On the other 
hand, padding only works in one direction: as far as anyone knows today, we could 
have P 4 NP even if EXP = NEXP. 


5 

To summarize, P = NP is just the tip of an iceberg; there seems to be an 
extremely rich structure both below and above the NP-complete problems. Until 
we can prove P # NP, however, most of that structure will remain conjectural. 


3 Beliefs About P = NP 


Just as Hilbert’s question turned out to have a negative answer, so too in this 
case, most computer scientists conjecture that P ~ NP: that there exist rapidly 
checkable problems that are not rapidly solvable, and for which brute-force search 
is close to the best that one can do. This is not a unanimous opinion. At least 
one famous computer scientist, Donald Knuth [131], has professed a belief that 
P = NP, while another, Richard Lipton [148], professes agnosticism. Also, in a 
poll of mathematicians and theoretical computer scientists conducted by William 
Gasarch [87] in 2002, there were 61 respondents who said P # NP, but also 9 


24 S. Aaronson 


who said P = NP. Admittedly, it can be hard to tell whether declarations that 
P = NP are meant seriously, or are merely attempts to be contrarian. However, 
we can surely agree with Knuth and Lipton that we are far from understanding the 
limits of efficient computation, and that there are further surprises in store. 

In this section, I’d like to explain why, despite our limited understanding, many 
of us feel roughly as confident about P 4 NP as we do about (say) the Riemann 
Hypothesis, or other conjectures in math—not to mention empirical sciences—that 
most experts believe without proof.!* 

The first point is that, when we ask whether P = NP, we are not asking whether 
heuristic optimization methods (such as SAT-solvers) can sometimes do well in 
practice; or whether there are sometimes clever ways to avoid exponential search. If 
you believe, for example, that there is any cryptographic one-way function—that is, 
any transformation of inputs x — f (x) that is easy to compute but hard to invert— 
then that is enough for P 4 NP. Such an f need not have any “nice” mathematical 
structure (like the discrete logarithm function); it could simply be, say, the evolution 
function of some arbitrary cellular automaton. 


It is sometimes claimed that, when we consider P =N P, there is a “symmetry of 
ignorance’: yes, we have no idea how to solve NP-complete problems in polynomial 
time, but we also have no idea how to prove that impossible, and therefore anyone is 
free to believe whatever they like. In my view, however, what breaks the symmetry 
is the immense, well-known difficulty of proving lower bounds. Simply put: even 
if we suppose P # NP, I don’t believe there’s any great mystery about why a 
proof has remained elusive. A rigorous impossibility proof is often a tall order, and 
many times in history—e.g., with Fermat’s Last Theorem, the Kepler Conjecture, 
or the problem of squaring the circle—such a proof was requested centuries before 
mathematical understanding had advanced to the point where it became a realistic 
possibility! And as we’ll see in Sects. 4 and 6, today we know something about the 
difficulty of proving even “baby” versions of P # NP; about the barriers that have 
been overcome and the others that remain to be. 

By contrast, if P = NP, then there is, at least, a puzzle about why the whole 
software industry, over half a century, has failed to uncover any promising leads for, 
say, a fast algorithm to invert arbitrary one-way functions (just the algorithm itself, 
not necessarily a proof that it works). The puzzle is heightened when we realize 
that, in many real-world cases—such as linear programming, primality testing, and 
network routing—fast methods to handle a problem in practice did come decades 
before a full theoretical understanding of why the methods worked. 

Another reason to believe P 4 NP comes from the hierarchy theorems, which 
we’ll meet in Sect. 6.1. Roughly speaking, these theorems imply that “most” pairs 
of complexity classes are unequal; the problem, in most cases, is simply that we 


'2] like to joke that, if computer scientists had been physicists, we’d simply have declared P # NP 
to be an observed law of Nature, analogous to the laws of thermodynamics. A Nobel Prize would 
even be given for the discovery of that law. (And in the unlikely event that someone later proved 
P = NP, a second Nobel Prize would be awarded for the law’s overthrow.) 
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can’t prove this for specific pairs! For example, in the chain of complexity classes 
P C NP C PSPACE C EXP, we know that P 4 EXP, which implies that at least 
one of P # NP, NP 4 PSPACE, and PSPACE # EXP must hold. So we might say: 
given the provable reality of a rich lattice of unequal complexity classes, one needs 
to offer a special argument if one thinks two classes collapse, but not necessarily if 
one thinks they’re different. 

To my mind, however, the strongest argument for P 4 NP involves the thousands 
of problems that have been shown to be NP-complete, and the thousands of other 
problems that have been shown to be in P. If just one of these problems had 
turned out to be both NP-complete and in P, that would have immediately implied 
P = NP. Thus, one could argue, the P # NP hypothesis has had thousands 
of opportunities to be “falsified by observation.” Yet somehow, in every case, the 
NP-completeness reductions and the polynomial-time algorithms conspicuously 
avoid meeting each other—a phenomenon that I once described as the “invisible 
fence” [7]. 

This phenomenon becomes particularly striking when we consider approxima- 
tion algorithms for NP-hard problems, which return not necessarily an optimal 
solution but a solution within some factor of optimal. To illustrate, there is a simple 
polynomial-time algorithm that, given a 3SAT instance ¢, finds an assignment that 
satisfies at least a 7/8 fraction of the clauses.'* Conversely, in 1997 Johan Hastad 
[105] proved the following striking result. 


Theorem 18 (Hastad [105]). Suppose there is a polynomial-time algorithm that, 
given as input a Satisfiable 3SAT instance ~, outputs an assignment that satisfies at 
least a 7/8 + & fraction of the clauses, where ¢ > 0 is any constant. Then P = NP. 


Theorem 18 is one (strong) version of the PCP Theorem [29, 30], which is 
considered one of the crowning achievements of theoretical computer science. The 
PCP Theorem yields many other examples of “sharp NP-completeness thresholds,” 
where as we numerically adjust the required solution quality, an optimization 
problem undergoes a sudden “phase transition” from being in P to being NP- 
complete. Other times there is a gap between the region of parameter space known 
to be in P and the region known to be NP-complete. One of the major aims of 
contemporary research is to close those gaps, for example by proving the so-called 
Unique Games Conjecture [127]. 

We see a similar “invisible fence” if we shift our attention from approximation 
algorithms to Leslie Valiant’s program of “accidental algorithms” [227]. The latter 
are polynomial-time algorithms, often for planar graph problems, that exist for 


Strictly speaking, this is for the variant of 3SAT in which every clause must have exactly three 
literals, rather than at most three. 

Also note that, if we allow the use of randomness, then we can satisfy a 7/8 fraction of the 
clauses in expectation by just setting each of the n variables uniformly at random! This is because 
a clause with three literals has 2 — 1 = 7 ways to be satisfied, and only one way to be unsatisfied. 
A deterministic polynomial-time algorithm that’s guaranteed to satisfy at least 7/8 of the clauses 
requires only a little more work. 
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certain parameter values but not for others, for reasons that are utterly opaque 
if one doesn’t understand the strange cancellations that the algorithms exploit. A 
prototypical result is the following: 


Theorem 19 (Valiant [227]). Let PLANAR3SAT be a special case of 3SAT in which 
the bipartite graph of clauses and variables is a planar graph. Now consider the 
following problem: given an instance of PLANAR3SAT which is monotone (i.e., 
has no negations), and in which each variable occurs twice, count the number of 
Satisfying assignments mod k. This problem is in P for k = 7, but is NP-hard under 
randomized reductions for k = 2.'* 


Needless to say (because otherwise you would have heard!), in not one of these 
examples have the “P region” and the “NP-complete region” of parameter space 
been discovered to overlap. For example, in Theorem 19, the NP-hardness proof 
just happens to fail if we ask about the number of solutions mod 7, the very case for 
which an algorithm is known. If P = NP then this is, at the least, an unexplained 
coincidence. If P ~ NP, on the other hand, then it makes perfect sense. 


3.1 Independent of Set Theory? 


Since the 1970s, there has been speculation that P A NP might be independent 
(that is, neither provable or disprovable) from the standard axiom systems for 
mathematics, such as Zermelo-Fraenkel set theory. To be clear, this would mean 
that either 


(1) P # NP, but that fact could never be proved (at least not in our usual formal 
systems), or else 

(2) a polynomial-time algorithm for NP-complete problems does exist, but it can 
never be proven to work, or to halt in polynomial time. 


Because P ¥ NP is a purely arithmetical statement (a I12-sentence), it can’t 
simply be excised from mathematics, as some formalists would do with (say) the 
Continuum Hypothesis or the Axiom of Choice. A polynomial-time algorithm for 
3Sat either exists or it doesn’t! But that doesn’t imply that we can prove which. 


In 2003, I wrote a survey article [1] about whether P = NP is formally 
independent, which somehow never got around to offering any opinion about the 
likelihood of that eventuality! So for the record: I regard the independence of 
P = NP as a farfetched possibility, as I do for the Riemann hypothesis, Goldbach’s 
conjecture, and other unsolved problems of “ordinary” mathematics. At the least, 


5 
Id say that the independence of P = NP has the status right now of a “free-floating 
speculation” with little or no support from past mathematical experience. 


'4Indeed, a natural conjecture would be that the problem is NP-hard under randomized reductions 
for all k # 7, but this remains open (Valiant, personal communication). 
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There have been celebrated independence results over the past century, but as far 
as I know they all fall into four classes: 


(1) Independence of statements that are themselves about formal systems: for 
example, that assert their own unprovability in ZF set theory, or ZF’s consis- 
tency. This is the class produced by Gédel’s incompleteness theorems. 

(2) Independence of statements in transfinite set theory, such as the Axiom of 
Choice (AC) and the Continuum Hypothesis (CH). Unlike “ordinary” mathe- 
matical statements—P #4 NP, the Riemann hypothesis, etc.—the set-theoretic 
ones can’t be rephrased in the language of elementary arithmetic; only questions 
about their provability from various axiom systems are arithmetical. For that 
reason, one can question whether AC, CH, and so on need to have definite truth- 
values at all, independent of the axiom system. In any case, the independence 
of set-theoretic principles seems different in kind, and less “threatening,” than 
the independence of arithmetical statements.'> 

(3) Independence from “weak” systems, which don’t encompass all accepted 
mathematical reasoning. Goodstein’s Theorem [91], and the non-losability of 
the Kirby-Paris hydra game [129] are two examples of interesting arithmetical 
statements that can be proved using small amounts of set theory (or ordinal 
induction), but not within Peano arithmetic. 

(4) Independence from ZF of strange combinatorial statements, which (alas) would 
never have been studied if not for their independence. Harvey Friedman [84] 
has produced striking examples of such statements. 


Of course, it’s possible that P # NP is unprovable, but that that fact itself will 
forever elude proof: indeed, maybe the question of the independence of P 4 NP is 
itself independent of set theory, and so on ad infinitum! But one can at least say 
that, if P 4 NP (or for that matter, the Riemann hypothesis, Goldbach’s conjecture, 
etc.) were proven independent of ZF, it would be an unprecedented development: 
probably history’s first example of an independence result that didn’t fall into one 
of the four classes above.'® 

The proof of independence would also have to be unlike any known inde- 
pendence proof. Ben-David and Halevi [39] noticed that the techniques used to 
prove statements such as Goodstein’s Theorem independent of Peano arithmetic, 
actually prove independence from the stronger theory PA+I1;: that is, Peano 
arithmetic plus the set of all true arithmetical I1,-sentences (sentences with a single 


‘Note also that, by the Shoenfield absoluteness theorem [202], one’s beliefs about the Axiom of 
Choice, the Continuum Hypothesis, or other statements proven independent of ZF via forcing can 
have no effect on the provability of arithmetical statements such as P 4 NP. 

‘lf a T1y-sentence like the Goldbach Conjecture or the Riemann Hypothesis were known to be 
independent of ZF, then it would also be known to be true, since any counterexample would have 
a trivial finite proof! On the other hand, we could also imagine, say, the Goldbach Conjecture 
being proven equivalent to the consistency of ZF, in which case we could say only that either ZF 
is consistent and Goldbach is true but ZF doesn’t prove either, or else ZF proves anything. In any 
case, none of this directly applies to P 4 NP, which is a I12-sentence. 
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universal quantifier and no existential quantifiers). However, if P #4 NP could be 
proven independent of PA +I1,, that would mean that no I1,-sentence implying 
P #~ NP could hold. And thus, for example, NP-complete problems would have 
to be solvable in n time for all computable functions f, no matter how slowly 
growing. We would “almost” have P = NP. 

As Sect.6 will discuss, there are various formal barriers—including the rel- 
ativization, algebrization, and natural proofs barriers—that explain why certain 
existing techniques cannot be powerful enough to prove P ~ NP. These barriers can 
be interpreted, literally, as proofs that P 4 NP is formally unprovable from certain 
sets of axioms: namely, axioms that capture, or at least come close to capturing, 
the power of the techniques in question (relativizing, algebrizing, or naturalizing 
techniques) [28, 112, 190]. In all these cases, however, the axiom sets are known 
not to capture all techniques in complexity theory: there are existing results that go 
beyond these axiom sets (albeit, only a few that go beyond all of them at once). Thus, 
these barriers indicate gaps in our current techniques, rather than in the foundations 
of mathematics. 


4 Why Is Proving P # NP Difficult? 


Let’s suppose that P 4 NP. Then given the disarming simplicity of the statement, 
why is proving it so difficult? As mentioned above, complexity theorists have 
identified three technical barriers, called relativization [34], natural proofs [191], 
and algebrization [10], that any proof of P #4 NP will need to overcome. They’ve 
also shown that it’s possible to surmount each of these barriers, though there are 
few results that surmount all of them simultaneously. The barriers will be discussed 
alongside progress toward proving P # NP in Sect. 6. 

However, one can also say something more conceptual, and possibly more 
illuminating, about the meta-question of why it’s so hard to prove hardness. In my 
view, the central reason why proving P # NP is hard is simply that, in case after 
case, there are amazingly clever ways to avoid brute-force search, and the diversity 
of those ways rivals the diversity of mathematics itself. And even if, as I said in 
Sect. 3, there seems to be an “invisible electric fence” separating the NP-complete 
problems from the slight variants of those problems that are in P—still, almost any 
argument anyone can imagine for why the NP-complete problems are hard would, 
if it worked, also apply to the variants in P. 

To illustrate, we saw in Sect. 2.1 that 3SAT is NP-complete. We also saw that 
2SAT, which is like 3SAT except with two variables per clause rather than three, is 
in P: indeed, 2SAT is solvable in linear time. Other variants of satisfiability that are 
in P include HORNSAT (where each clause is an OR of arbitrary many non-negated 
variables and at most one negated variable), and XORSAT (where each clause is a 
linear equation mod 2, such as x2 ® x7 ® x9 = 1 (mod 2)). 

Likewise, even though it’s NP-complete to decide whether a given graph is 
3-colorable, one can decide in linear time whether a graph is 2-colorable. Also, 
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even though SUBSETSUM is NP-complete, one can easily decide whether there’s a 
subset of aj,...,a, summing to b in time that’s nearly linear in a; + «++ + ax. In 
other words, if each a; is required to be encoded in “unary” notation (that is, as a list 
of a; ones) rather than in binary, then SUBSETSUM is in P. 

As a more interesting example, finding the maximum clique in a graph is NP- 
complete, as are finding the minimum vertex cover, the chromatic number, and so 
on. Yet in the 1960s, Edmonds [74] famously showed that the maximum matching— 
that is, the largest set of edges no two of which share a vertex—can be found in P. 
To a casual observer, matching doesn’t look terribly different from the other graph 
optimization problems, but it is different. 

Or consider linear, semidefinite, and convex programming. These techniques 
yield hundreds of optimization problems that “seem at first like they should be 
NP-complete,” yet are solvable in P. A few examples are finding maximum flows, 
finding equilibria of two-player zero-sum games, training linear classifiers, and 
optimizing over quantum states and unitary transformations.!” 

We can also give examples of “shocking” algorithms for problems that are clearly 
in P. Most famously, the problem of multiplying two nxn matrices, C = AB, seems 
like it should “obviously” require ~ n? steps: ~ n steps for each of the n? entries of 
the product matrix C. But in 1968, Strassen [214] discovered an algorithm that takes 
only O (n'o8 z) steps. There has since been a long sequence of further improvements, 
culminating in the O (n?3"°) algorithm by Coppersmith and Winograd [70], and its 
recent improvements to O (n?3”*) by Stothers [213] and to O (n*77) by Vassilevska 
Williams [230]. Thus, letting w be the matrix multiplication exponent (1.e., the least 


77 won’t have much to say about linear or semidefinite programming in this survey, so perhaps 
this is as good a place as any to mention that today, we know a great deal about the impossibility 
of solving NP-complete problems in polynomial time by formulating them as “natural” linear 
programs. This story starts in 1987, with a preprint by Swart [217] that claimed to prove P = 
NP by reducing the Traveling Salesperson Problem to a linear program with n® variables and 
constraints. Swart’s preprint inspired a landmark paper by Yannakakis [244] (making it possibly 
the most productive failed P = NP proof in history!), in which Yannakakis showed that there 
is no “symmetric” linear program with n?”) variables and constraints that has the “Traveling 
Salesperson Polytope” as its projection onto a subset of the variables. This ruled out Swart’s 
approach. Yannakakis also showed that the polytope corresponding to the maximum matching 
problem has no symmetric LP of subexponential size, but the polytope for the minimum spanning 
tree problem does have a polynomial-size LP. In general, expressibility by such an LP is sufficient 
for a problem to be in P, but not necessary. 

Later, in 2012, Fiorini et al. [76] substantially improved Yannakakis’s result, getting rid of 
the symmetry requirement. There have since been other major results in this direction: in 2014, 
Rothvof8[194] showed that the perfect matching polytope requires exponentially-large LPs (again 
with no symmetry requirement), while in 2015, Lee, Raghavendra, and Steurer [140] extended 
many of these lower bounds from linear to semidefinite programs. 

Collectively, these results rule out one “natural” approach to proving P = NP: namely, to start 
from famous NP-hard optimization problems like TSP, and then find a polynomial-size linear or 
semidefinite program that projects onto the polytope whose extreme points are the valid solutions. 
Of course, we can’t yet rule out the possibility that linear or semidefinite programs could help 
prove P = NP in some more indirect way (or via some NP-hard problem other than the specific 
ones that were studied); ruling that out seems essentially tantamount to proving P # NP itself. 
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@ such that n x n matrices can be multiplied in n®*° time), all we know today 
is that w € [2, 2.373]. Some computer scientists conjecture that w = 2; but in any 
case, just like with attempts to prove P # NP, an obvious obstruction to proving 
@ > 2 is that the proof had better not yield w = 3, or even a “natural-looking” 
bound like w > 2.5. 

The breadth of clever polynomial-time algorithms might seem like a trite 
observation, incommensurate with the challenge of explaining why it’s so hard to 
prove P 4 NP. Yet we have evidence to the contrary. Over the decades, there have 
been hundreds of flawed ‘proofs’ announced for P 4 NP. The announcement that 
received the most attention, including coverage in The New York Times and other 
major media outlets, was that of Deolalikar [73] in 2010. But in every such case that 
I’m aware of, the proof could ultimately be rejected on the ground that, if it worked, 
then it would also yield superpolynomial lower bounds for problems known to be 
in P. 

With some flawed P 4 NP proofs, this is easy to see: for example, perhaps 
the author proves that 3SAT must take exponential time, by some argument that’s 
fearsome in technical details, but that ultimately boils down to “there are 2” possible 
assignments to the variables, and clearly any algorithm must spend at least one step 
rejecting each of them.” A general-purpose refutation of such arguments is simply 
that, if they worked, then they’d work equally well for 2SAT. Alternatively, one 
could point out that, as we’ll see in Sect. 5.1, it’s known how to solve 3SAT in 1.3” 
time. Soa P ¥ NP proof had better not imply a Q (2”) lower bound for 3SAT. 

In the case of Deolalikar’s P + NP attempt [73], the details were more 
complicated, but the bottom line ended up being similar. Deolalikar appealed to 
certain statistical properties of the set of satisfying assignments of a random 3SAT 
instance. The claim was that, for reasons having to do with logical definability, those 
statistical properties precluded 3SAT from having a polynomial-time algorithm. 
During an intense online discussion, however, skeptics pointed out that random 
XORSAT—which we previously mentioned as a satisfiability variant in P—gives 
rise to solution sets indistinguishable from those of random 3SAT, with respect to 
the properties Deolalikar was using (see for example [229]). This implied that there 
must be one or more bugs in the proof, though it still left the task of finding them 
(which was also done). 

None of this means that proving P 4 NP is impossible. A priori, it might also 
have been hard to imagine a proof of the unsolvability of the halting problem, 
but of course we know that such a proof exists. As we’ll see in Sect.6.1, a 
central difference between the two cases is that methods from logic—namely, 
diagonalization and self-reference—worked to prove the unsolvability of the halting 
problem, but there’s a precise sense in which these tricks can’t work (at least not 
by themselves) to prove P # NP. A related difference comes from the quantitative 
character of P #4 NP: somehow, any proof will need to explain why (say) a 1.00001” 
or 2V” algorithm for 3SAT is impossible, even though a 1.3” algorithm exists. In 
some sense, this need to make fine quantitative distinctions—to say that, yes, brute- 
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force search can be beaten, but only by this much for this problem and by that much 
for that one—puts a lower bound on the sophistication of any P 4 NP proof. 


5 Strengthenings of the P 4 NP Conjecture 


I'll now survey various strengthenings of the P + NP conjecture, which are 
often needed for applications to cryptography, quantum computing, fine-grained 
complexity, and elsewhere. Some of these strengthenings will play a role when, 
in Sect. 6, we discuss the main approaches to proving P # NP that have been tried. 


5.1 Different Running Times 


There has been a great deal of progress on beating brute-force search for many NP- 
complete problems, even if the resulting algorithms still take exponential time. For 
example, the following was shown by Schéning. 


Theorem 20 (Schéning [198]). There is a randomized algorithm that solves 3SAT 
in O(1.3") time. 


For many NP-complete problems like HAMILTONCYCLE, for which the obvious 
brute-force algorithm takes ~ n! time, it is also possible to reduce the running time 
to O(2”), or sometimes even to O(c") for some c < 2, through clever dynamic 
programming. 

How far can these algorithms be pushed? For example, is it possible that 
3SAT could be solvable in 20(vn) time, as various NP-intermediate problems like 
FACTORING are known to be? An important conjecture called the Exponential Time 
Hypothesis, or ETH, asserts that the answer is no: 


Conjecture 21 (Exponential Time Hypothesis). Any deterministic algorithm for 
3SAT takes Q (c") steps, for some constant c > 1. 


The ETH is an ambitious strengthening of P 4 NP, one that has found many 
applications in recent years. Often, for example, assuming the ETH, it can be 
shown that some particular polynomial-time or quasipolynomial-time algorithm 
is optimal (i.e., that its exponent can’t be improved), whereas nothing similar is 
known assuming only P # NP. One example is the problem of computing the 
edit distance between two strings, or the minimum number of insertions, deletions, 
and replacements needed to transform one string to the other. Here a quadratic-time 
algorithm has long been known, while in a recent breakthrough, Backurs and Indyk 
[33] proved that algorithm to be essentially optimal assuming the ETH. A second 
example is the problem of approximating the value of a two-prover “free game”: 
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here Aaronson et al. [9] gave an n°"°S” algorithm, and also proved that algorithm 
essentially optimal assuming the ETH. 


5.2 Nonuniform Algorithms and Circuits 


P = NP asks whether there is a single algorithm that, for each input size n, solves an 
NP-complete problem like 3S AT in time polynomial inn. But one could also allow a 
different algorithm for each input size: for example, imagine a clever approach that 
yielded a speedup for n = 1000 but was swamped by constant factors for n = 100, 
another approach that worked for n = 10,000 but not n = 1000, and so on. To 
formalize this notion, let P/poly be the class of languages L for which there exists 
a polynomial-time Turing machine M, as well as an infinite set of “advice strings” 
a, @2,..., where a, is p (n) bits long for some polynomial p, such that for all n and 
all x € {0, 1}", we have 


M (x, dy) accepts <=> x € L. 


An equivalent way to define P/poly is as the class of languages recognized by a 
family of polynomial-size circuits: that is, networks of Boolean logic gates (such 
as AND, OR, NOT), with the input bits x;,...,x, at the bottom and with a bit 
determining whether x € L at the top, where the network can be different for each 
input length n.'® The size of such a circuit is simply the number of logic gates in it. 
P/poly is a nonuniform generalization of P: certainly P C P/poly, but there is no 
containment in the other direction.!° 
Now, the nonuniform version of the P 4 NP conjecture is the following. 


Conjecture 22. NP ¢ P/poly. 


If P = NP, then certainly NP C P/poly, but the converse need not hold. About 
the closest we have to a converse is the Karp-Lipton Theorem [123]: 


Theorem 23. If NP Cc P/poly, then PH collapses to £5. 


Proof. Consider a complete problem for IT5: say, “for all x € {0,1}” does 
there exist a y € {0, 1}” such that A (x, y) accepts?”, for some polynomial p and 
polynomial-time algorithm A. Assuming NP Cc P/poly, we can solve that problem 
in pI as follows: 


'8Despite the term “circuit,” which comes from electrical engineering, circuits in theoretical 
computer science are always free of cycles; they proceed from the inputs to the output via layers 
of logic gates. 

'°This is a rare instance where the non-containment can actually be proved: for example, any unary 
language (i.e., language of the form {0" : n € S}) is clearly in P/poly, but there is an uncountable 
infinity of such languages, so almost all of them cannot be in P. 
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¢ “Does there exist a circuit C such that for all x, the algorithm A (x, C (x)) 
accepts?” 


For if NP Cc P/poly and VxdyA (x,y) is true, then clearly there exists a 
polynomial-size circuit C that takes x as input, and outputs a y such that A (x, y) 
accepts. So we can simply use the existential quantifier in our =P algorithm to guess 
a description of that circuit. 

We conclude that, if NP C P/poly, then Be Cc p34 (and by symmetry, = G 
Te ). But this is known to cause a collapse of the entire polynomial hierarchy to 
S. a 


In summary, while most complexity theorists conjecture that NP ¢ P/poly, as 
far as we know it is a stronger conjecture than P 4 NP. Indeed, it is even plausible 
that future techniques could prove P 4 NP without proving NP ¢ P/poly: for 
example, as we will discuss in Sect. 6.1, we can currently prove P # EXP, but 
cannot currently prove EXP ¢ P/poly, or even NEXP ¢ P/poly. Despite this, as 
we will see in Sect. 6, most of the techniques that have been explored for proving 
P # NP, would actually yield the stronger result NP ¢ P/poly if they worked. For 


that reason, P/poly plays a central role in work on the P = NP question. 

There is one other aspect of circuit complexity that will play a role later in this 
survey, and that is depth. The depth of a circuit simply means the length of the 
longest path from an input bit to the output bit—or, if we think of the logic gates 
as organized into layers, then the number of layers. There is a subclass of P/poly 
called NC’ (the NC stands for “Nick’s Class,” after Nick Pippenger) which consists 
of all languages that are decided by a family of circuits that have polynomial size 
and also depth O (logn).”° One can also think of NC' as the class of problems 
solvable in logarithmic time (nonuniformly) using a polynomial number of parallel 
processors. It is conjectured that P ¢ NC! (that is, not all efficient algorithms can 
be parallelized), but alas, even showing that NEXP ¢ NC! remains open at present. 

Another way to define NC' is as the class of languages decidable by a family of 
polynomial-size Boolean formulas. In theoretical computer science, a formula just 
means a circuit where every gate has fanout | (that is, where a gate cannot have its 
output fed as input to multiple other gates). To see the equivalence: in one direction, 
by replicating subcircuits wherever necessary, clearly any circuit of depth d and size 
s can be “unraveled” into a formula of depth d and size at most 2¢s, which is still 
polynomial in n if d = O(logn) and s = n°), In the other direction, there is an 
extremely useful fact proved by Brent [52], called “depth reduction.” 


Proposition 24 (Brent [52]). Given any Boolean formula of size S, there is an 
equivalent formula of size S°) and depth O (log S).”! 


0Note that, if each logic gate depends on at most 2 inputs, then log, n is the smallest depth that 
allows the output to depend on all n input bits. 

21Bshouty, Cleve, and Eberly [53] showed that the size of the depth-reduced formula can even be 
taken to be O (s lye) for any constant ¢ > 0. 


34 S. Aaronson 


Because of Proposition 24, the minimum depth D of any formula for a Boolean 
function f is simply © (log S), where S is the minimum size of any formula for /. 
For circuits, by contrast, size and depth are two independent variables, which might 
in general be related only by D < S < 2?. 


5.3 Average-Case Complexity 


If P * NP, that means that there are NP problems for which no Turing machine 
succeeds at solving all instances in polynomial time. But often, especially in 
cryptography, we need more than that. It would be laughable to advertise a 
cryptosystem on the grounds that there exist messages that are hard to decode! Thus, 
it is natural to ask whether there are NP problems that are hard “in the average case” 
or “on random instances,” rather than merely in the worst case. More pointedly, does 
the existence of such problems follow from P # NP, or is it a different, stronger 
assumption? 

The first step is to clarify what we mean by a “random instance.” For some 
NP-complete problems, it makes sense to ask about a uniform random instance: 
for example, we can consider 3SAT with n variables and m = an uniformly- 
random clauses (for some constant a), or 3COLORING on an Erdés-Rényi random 
graph. In those cases, the difficulty tends to vary wildly with the problem and 
the precise distribution. With 3SAT, for example, if the clause/variable ratio @ is 
too small, then random instances are trivially satisfiable, while if a is too large, 
then they are trivially unsatisfiable. But there is a “sweet spot,’ a ~ 4.25, where 
random 3SAT undergoes a phase transition from satisfiable to unsatisfiable, and 
where the difficulty seems to blow up accordingly. Even at the threshold, however, 
random 3SAT might still be much easier than worst-case 3SAT: the breakthrough 
“survey propagation algorithm” [50] can solve random 3SAT quickly, even for a 
extremely close to the threshold.’* More generally, there has been a great deal 
of work on understanding particular distributions over instances, often using tools 
from statistical physics (for an accessible introduction, see for example Moore and 
Mertens [155]). Unfortunately, there are almost no known reductions among these 
sorts of distributional problems, which would let us say that if one of them is hard 
then so is another. The reason is that almost any imaginable reduction from problem 
A to problem B will map a random instance of A to an extremely special, non- 
random instance of B. 

This means that, if we want to pick random instances of NP-complete problems 
and be confident they are hard, then we might need carefully-tailored distributions. 
Levin [142], and Li and Vitanyi [143], observed that there exists a “universal 
distribution” D with the remarkable property that any algorithm that fails on any 
instance, will also fail with high probability with respect to instances drawn from D. 


>2But making matters more complicated still, survey propagation fails badly on random 4Sar. 
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Briefly, one constructs D by giving each string x € {0, 1}* a probability proportional 
to 2-*“, where K (x) is the Kolmogorov complexity of x: that is, the number of bits 
in the shortest computer program whose output is x. One then argues that, given 
any algorithm A, one can design a short computer program that brute-force searches 
for the first instances on which A fails—and for that reason, if there are any such 
instances, then D will assign them a high probability. 

In this construction, the catch is that there is no feasible way actually to sample 
instances from the magical distribution D. Thus, given a family of distributions 
D = {D,},, where D,, is over {0, 1}", call D efficiently samplable if there exists 
a Turing machine that takes as input a positive integer n and a uniformly random 
string r € {0,1}? (for some polynomial p), and that outputs a sample from D,, 
in time polynomial in n.”*> Then the real question, one might say, is whether any 
NP-complete problems are hard on average with respect to efficiently samplable 
distributions. More formally, does the following conjecture hold? 


Conjecture 25 (NP Hard on Average). There exists a language L € NP, as well 
as an efficiently samplable family of distributions D = {D,},, such that for all 
polynomial-time algorithms A, there exists an n such that 


Pr [A (x) = L(x)] < 0.51. 


Here L(x) € {0, 1} denotes the characteristic function of L. 


It is a longstanding open problem whether P # NP implies Conjecture 25. 
There are NP-intermediate problems—one famous example being the discrete 
logarithm problem—that are known to have the remarkable property of worst- 
case/average-case equivalence. That is, any polynomial-time algorithm for these 
problems that works on (say) 10% of instances implies a polynomial-time algorithm 
for all instances; and conversely, if the problem is hard at all then it is hard on 
average. However, despite decades of work, no one has been able to show worst- 
case/average-case equivalence for any NP-complete problem (with respect to any 
efficiently samplable distribution), and there are known obstacles to such a result. 
For details, see for example the survey by Bogdanov and Trevisan [47]. 


5.3.1 Cryptography and One-Way Functions 


One might hope that, even if we cannot base secure cryptography solely on the 
assumption that P # NP, at least we could base it on Conjecture 25. But there is 
one more obstacle. In cryptography, we do not merely need NP problems for which 
it is easy to generate hard instances: rather, we need NP problems for which it is 
easy to generate hard instances, along with secret solutions to those instances. This 


23We could also allow sampling from some distribution close to D,, but we will ignore that 
complication here. 
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motivates the definition of a one-way function (OWF), perhaps the central concept 

of modern cryptography. Let f = {f,},, be a family of functions, with f, : {0, 1}" > 

£0, 1}? for some polynomial p. Then we call f a one-way function family if 

(1) f, is computable in time polynomial in 7, but 

(2) f, is hard to invert: that is, for all polynomial-time algorithms A and all 
polynomials q, 


1 


ee fn (A Gn (x) = fn (X)] < ev 


We then make the following conjecture. 
Conjecture 26. There exists a one-way function family. 


Conjecture 26 is stronger than Conjecture 25, which in turn is stronger than P # 
NP. Indeed, it is not hard to show the following. 


Proposition 27. Conjecture 26 holds if and only if there exists a fast way to gener- 
ate hard random 3SAT instances with “planted solutions”: that is, an efficiently 
samplable family of distributions D = {D,},, over (~,x) pairs, where p is a 
satisfiable 3SAT instance and x is a satisfying assignment to », such that for all 
polynomial-time algorithms A and all polynomials q, 


1 
q(n) 


Pr [A (¢) finds a satisfying assignment to gy] < 
g~Dn 


Proof. Given a one-way function family f, we can generate a hard random 3SAT 
instance with a planted solution by choosing x € {0,1}” uniformly at random, 
computing f,, (x), and then using the Cook-Levin Theorem (Theorem 2) to construct 
a 3SAT instance that encodes the problem of finding a preimage of f,, (x). Conversely, 
given a polynomial-time algorithm that takes as input a positive integer n and a 
random string r € {0, 1}” ®) (for some polynomial p), and that outputs a hard 3SAT 


instance y, together with a planted solution x, to y,, the function f, (r) := @, will 
necessarily be one-way (since inverting f,, would let us find a satisfying assignment 
to @,). | 


Conjecture 26 turns out to suffice for building most of the ingredients of private- 
key cryptography, notably including pseudorandom generators [106] and pseudo- 
random functions [88]. Furthermore, while Conjecture 26 is formally stronger than 
P 4 NP, Proposition 27 suggests that the two conjectures are conceptually similar: 
“all we are asking for” is a hard NP problem, together with a fast way to generate 
hard solved instances of it. 

This contrasts with the situation for public-key cryptography—i.e., the kind of 
cryptography that does not require any secrets to be shared in advance, and which 
is used for sending credit-card numbers over the web. To create a secure public-key 
cryptosystem, one needs something even stronger than Conjecture 26: for example, 
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a trapdoor OWF,”* which is an OWF with the additional property that it becomes 
easy to invert if one is given a secret “trapdoor” string generated along with the 
function. We do, of course, have candidates for secure public-key cryptosystems, 
which are based on problems such as factoring, discrete logarithms (over both 
multiplicative groups and elliptic curves), and finding short nonzero vectors in 
lattices. To date, however, all public-key cryptosystems require “sticking one’s neck 
out,’ and conjecturing the hardness of some specific NP-intermediate problem, 
something with much more structure than any known NP-complete problem. In 
other words, for public-key cryptography, today one needs additional conjectures 
that go fundamentally beyond P ¥ NP, or even the existence of OWFs. 


5.4 Randomized Algorithms 


Even assuming P 4 NP, we can still ask whether NP-complete problems can 
be solved in polynomial time with help from random bits. This is a different 
question than whether NP is hard on average: whereas before we were asking about 
algorithms that solve most instances (with respect to some distribution), now we are 
asking about algorithms that solve all instances, for most choices of some auxiliary 
random numbers. 

Historically, algorithm designers have often resorted to randomness, to deal with 
situations where most choices that an algorithm could make are fine, but any specific 
choice will lead to terrible behavior on certain inputs. For example, in Monte Carlo 
simulation, used throughout science and engineering, one estimates the volume of 
a high-dimensional object by just sampling random points, and then checking what 
fraction of them lie inside. 

A second example concerns primality testing: that is, deciding the language 


PRIMES = {NV : N is a binary encoding of a prime number} . 


In modern cryptosystems such as RSA, it is just as important that primality testing 
be easy as that the related factoring problem be hard. In the 1970s, Rabin [181] and 
Solovay and Strassen [210] showed how to decide PRIMES in time polynomial in 
log N (i.e., the number of digits of N). The small catch was that their algorithms 
were randomized: in addition to N, they required a second input r; and for each N, 
the algorithms were guaranteed to succeed for most r’s but not all of them. Miller 
[154] also proposed a deterministic polynomial-time algorithm for PRIMES, but 
could only prove the algorithm correct assuming the Extended Riemann Hypothesis. 
Finally, after decades of work on the problem, in 2002 Agrawal, Kayal, and Saxena 
[14] gave an unconditional proof that PRIMES is in P. In other words, if one only 


4There are closely-related objects, such as “lossy” trapdoor OWFs (see [178]), that also suffice 
for building public-key cryptosystems. 
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cares about testing primality in polynomial time, and not about the degree of the 
polynomial, then randomness was never needed after all. 

A third example concerns the problem of polynomial identity testing (PIT). Here 
we are given as input a circuit or formula, composed of addition and multiplication 
gates, that computes a polynomial p : F — F over a finite field F. The question is 
whether p is the identically-zero polynomial—that is, whether the identity p (x) = 0 
holds. If deg (p) « |IF|, then the Fundamental Theorem of Algebra immediately 
suggests a way to solve this problem: simply pick an x € F uniformly at random 
and check whether p(x) = 0. Since a nonzero polynomial p can vanish on at 
most deg (p) points, the probability that we will “get unlucky” and choose one of 
those points is at most deg (p) / |F|. To this day, no one knows of any deterministic 
approach that achieves similar performance,”> and derandomizing PIT is considered 
one of the frontier problems of theoretical computer science. For details, see for 
example the survey of Shpilka and Yehudayoff [205]. 


5.4.1 BPP and Derandomization 


What is the power of randomness more generally? Can every randomized algorithm 
be derandomized, as ultimately happened with PRIMES? To explore these issues, 
complexity theorists study several randomized generalizations of the class P. We 
will consider just one of them: Bounded-Error Probabilistic Polynomial-Time, or 
BPP, is the class of languages L © {0, 1}* for which there exists a polynomial-time 
Turing machine M, as well as a polynomial p, such that for all inputs x € {0, 1}", 


Pr [M(x,r) =L()] = 
re{o, 1? 


WIN 


In other words, for every x, the machine M must correctly decide whether x € L 
“most of the time” (that is, for most choices of r). Crucially, here we can easily 
replace the constant 2/3 by any other number between 1/2 and 1, or even by a 
function like 1 — 2~”. So for example, if we wanted to know x € L with 0.999999 
confidence, then we’d simply run M several times, with different independent values 
of r, and then output the majority vote among the results. 

It is clear that P C BPP C PSPACE; more interestingly, Sipser [206] and 
Lautemann [139] proved that BPP is contained in 4 al me (that is, the second level 
of PH). The Rabin-Miller and Solovay-Strassen algorithms imply that PRIMES € 
BPP. 


5 At least, not for arbitrary polynomials computed by small formulas or circuits. A great deal 
of progress has been made derandomizing PIT for restricted classes of polynomials. In fact, the 
deterministic primality test of Agrawal et al. [14] was based on a derandomization of one extremely 
special case of PIT. 
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Today, most complexity theorists conjecture that what happened to PRIMES can 
happen to all of BPP: 


Conjecture 28. P = BPP. 


The reason for this conjecture is that it follows from the existence of good enough 
pseudorandom generators, which we could use to replace the random string r in any 
BPP algorithm M by deterministic strings that “look random, as far as M can tell.” 
Furthermore, work in the 1990s showed that, if we grant certain extremely plausible 
lower bounds on circuit size, then these pseudorandom generators exist. Perhaps the 
most striking result along these lines is that of Impagliazzo and Wigderson [115]: 


Theorem 29 (Impagliazzo-Wigderson [115]). Suppose there exists a language 
decidable in 2" time, which requires nonuniform circuits of size 2°. Then P = 


BPP. 


Of course, if P = BPP, then the question of whether randomized algorithms can 


efficiently solve NP-complete problems is just the original P = NP question in a 
different guise. Ironically, however, the “obvious” approach to proving P = BPP is 
to prove a strong circuit lower bound—and if one knew how to do that, perhaps one 
could prove P ~ NP as well! 

Even if we don’t assume P = BPP, it’s easy to show that deterministic 
nonuniform algorithms (see Sect. 5.2) can simulate randomized algorithms: 


Proposition 30 (Adleman [11]). BPP c P/poly. 


Proof. Let the language L be decided by a BPP algorithm that uses p(n) random 
bits. Then by using g(n) = O(n- p(n)) random bits, running the algorithm O (n) 
times with independent random bits each time, and outputting the majority answer, 
we can push the probability of error on any given input x € {0, 1}” from 1/3 down 
to (say) 2-7”. Thus, the probability that there exists an x € {0,1}" on which the 
algorithm errs is at most 2” (a?) = 2". This means, in particular, that there must 
be a fixed choice for the random string r € {0, ya” that causes the algorithm to 
succeed on all x € {0, 1}". So to decide L in P/poly, we simply “hardwire” that r as 
the advice. a 


By combining Theorem 23 with Proposition 30, we immediately obtain that if 
NP C BPP, then the polynomial hierarchy collapses to the second level. So the 


bottom line is that the NP C BPP question is likely identical to the P = NP 
question, but is extremely tightly related even if not. 


5.5 Quantum Algorithms 


The class BPP might not exhaust what the physical world lets us efficiently 
compute, with quantum computing an obvious contender for going further. In 1993, 
Bernstein and Vazirani [42] defined the complexity class BQP, or Bounded-Error 
Quantum Polynomial-Time, as a quantum-mechanical generalization of BPP. 
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(Details of quantum computing and BQP are beyond the scope of this survey, but 
see [6, 172].) Bernstein and Vazirani, along with Adleman et al. [12], also showed 
some basic containments: 


P C BPP C BQP C PPC P* c PSPACE. 


In 1994, Shor [203] famously showed that the factoring and discrete logarithm 
problems are in BQP—and hence, that a scalable quantum computer, if built, could 
break almost all currently-used public-key cryptography. To design his quantum 
algorithms, Shor had to exploit extremely special properties of factoring and discrete 
logarithm, which are not known to hold for NP-complete problems. 


‘ 
The quantum analogue of the P = NP question is the question of whether NP C 

BQP: that is, can quantum computers solve NP-complete problems in polynomial 

time ?°° Most quantum computing researchers conjecture that the answer is no: 


Conjecture 31. NP ¢ BQP. 


Naturally, there is little hope of proving Conjecture 31 at present, since any proof 
would imply P # NP! We do not even know today how to prove conditional 
statements (analogous to what we have for BPP and P/poly): for example, that 
if NP C BQP then PH collapses. On the other hand, it is known that, if a fast 
quantum algorithm for NP-complete problems exists, then in some sense it will 
have to be extremely different from Shor’s or any other known quantum algorithm. 
For example, if we ignore the structure of NP-complete problems, and just consider 
the abstract task of searching an unordered list, then quantum computers can provide 
at most a square-root speedup over the classical running time [40]. This implies that 
there exists an oracle A such that NP“ ¢ BQP%. Note that the square-root speedup 
is actually achievable, using Grover’s algorithm [97]. For most NP-complete 
problems, however, the fastest known quantum algorithm will be obtained by simply 
layering Grover’s algorithm on top of the fastest known classical algorithm, yielding 
a quadratic speedup but no more.”’ So for example, as far as anyone knows today, 
even a quantum computer would need 2°” time to solve 3SAT. 


6One can also consider the QMA-complete problems, which are a quantum generalization of the 
NP-complete problems themselves (see [48]), but we will not pursue that here. 

?7One can artificially design an NP-complete problem with a superpolynomial quantum speedup 
over the best known classical algorithm by, for example, taking the language 


L= {0g0- --0 | g isa satisfiable 3Sat instance of size poly U 


{1x | x is a binary encoding of a positive integer with an odd number of distinct prime factors} . 


Clearly L is NP-complete, and a quantum algorithm can decide L in O (") time for some c, 


whereas the best known classical algorithm will take exp (7) time. 

Conversely, there are also NP-complete problems with no significant quantum speedup 
known—say, because the best known classical algorithm is based on dynamic programming, 
and it’s unknown how to combine that with Grover’s algorithm. A candidate example is the 
Traveling Salesman Problem, which is solvable in O(2" poly (n)) time using the Held-Karp 
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Of course, one can also wonder whether the physical world might provide com- 
putational resources even beyond quantum computing (based on black holes? closed 
timelike curves? modifications to quantum mechanics?), and if so, whether those 
resources might enable the polynomial-time solution of NP-complete problems. 
Such speculations are beyond the scope of this article, but see for example [3]. 


6 Progress 


One common view among mathematicians is that questions like P ms NP, while 
undoubtedly important, are just too hard to make any progress on in the present 
state of mathematics. It’s true that we seem to be nowhere close to a solution, but 
in this section, I'll build a case that the extreme pessimistic view is unwarranted. 
I'll explain what genuine knowledge I think we have, relevant to proving P 4 NP, 
that we didn’t have thirty years ago or in some cases ten years ago. One could argue 
that, if P 4 NP is a distant peak, then all the progress has remained in the foothills. 
On the other hand, scaling the foothills has already been highly nontrivial, so anyone 
who wants to work on P ¥ NP had better get acquainted with what’s been done. 

More concretely, I'll tell a story here of the interaction between lower bounds 
and barriers: on the one hand, actual successes in proving superpolynomial or 
exponential lower bounds in interesting models of computation; but on the other, 
explanations for why the techniques used to achieve those successes don’t extend to 
prove P ~ NP. We’ll see how the barriers influence the next generation of lower 
bound techniques, which are sometimes specifically designed to evade the barriers, 
or evaluated on their potential to do so. 

With a single exception—namely, the Mulmuley-Sohoni Geometric Complexity 
Theory program—I’Il restrict my narrative to ideas that have already had definite 
successes in proving limits on computation, beyond what had previously been 
known. The drawback of this choice is that in many cases, the ideas that are concrete 
enough to have worked for something, are also concrete enough that we understand 
why they can’t work for P #4 NP! My defense is that this section would be 
unmanageably long, if it had to cover every idea about how P 4 NP might someday 
be proved. 

I should, however, at least mention some important approaches to lower bounds 
that will be missing from my subsequent narrative. The first is descriptive complex- 
ity theory; see for example the book of Immerman [111] for a good introduction. 
Descriptive complexity characterizes many complexity classes in terms of their 
logical expressive power: for example, P corresponds to sentences expressible in 
first-order logic with linear order and a least fixed point; NP to sentences expressible 
in existential second-order logic; PSPACE to sentences expressible in second-order 


dynamic programming algorithm [107], whereas Grover’s algorithm seems to yield only the worse 


bound O(./n!). 
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logic with transitive closure; and EXP to sentences expressible in second-order 
logic with a least fixed point. The hope is that characterizing complexity classes 
in this way, with no explicit mention of resource bounds, will make it easier to 
see which are equal and which different. There is one major piece of evidence for 
this hope: namely, descriptive complexity played an important role in the proof by 
Immerman [110] that nondeterministic space is closed under complement (though 
the independent proof by Szelepcsényi [218] of the same result did not use these 
ideas). Descriptive complexity theory has not yet led to new separations. 

The second approach is lower bounds via communication complexity. Given a 
Boolean function f : {0, 1}” > {0, 1}, consider the following communication game: 
Alice receives an x such that f (x) = 0, Bob receives an input x such that f (x) = 1, 
and their goal is to agree on an index i € {1,...,} such that x; A y;. Let Cy be the 
communication complexity of this game: that is, the minimum number of bits that 
Alice and Bob need to exchange to win the game, if they use an optimal protocol 
(and where the communication cost is maximized over all x, y pairs). Then in 1990, 
Karchmer and Wigderson [121] showed the following remarkable connection. 


Theorem 32 (Karchmer-Wigderson [121]). For any f, the minimum depth of any 
Boolean circuit for f is equal to Cy. 


Combined with Proposition 24 (depth-reduction for formulas), Theorem 32 
implies that every Boolean function f requires formulas of size at least 2°: 
in other words, communication lower bounds imply formula-size lower bounds. 
Since communication complexity is a well-established area of theoretical computer 
science with many strong lower bounds (see for example the book by Kushilevitz 
and Nisan [134]), one might therefore hope that lower-bounding the communication 
cost of the “Karchmer-Wigderson game” would be a viable approach to proving 
NP ¢ NC! or P ¢ NC", either of which would constitute a huge step toward 
P #~ NP. 

See Sect. 6.2.2 for Karchmer and Wigderson’s applications of a similar connec- 
tion to monotone formula-size lower bounds. Also see Aaronson and Wigderson 
[10] for further connections between communication complexity and computational 
complexity—including even a “communication complexity lower bound” that if true 
would imply P # NP. Of course, the question is whether these translations merely 
shift the difficulty of complexity class separations to a superficially different setting, 
or whether they set the stage for genuinely new insights. 

The third approach is lower bounds via derandomization. In Sect.5.4.1, we 
discussed the discovery in the 1990s that, if sufficiently strong circuit lower bounds 
hold, then P = BPP: that is, every randomized algorithm can be made deterministic 
with only a polynomial slowdown. In the early 2000s, it was discovered that 
converse statements often hold as well: that is, derandomizations of randomized 
algorithms imply circuit lower bounds. Probably the most-cited result along these 
lines is that of Kabanets and Impagliazzo [118]: 
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Theorem 33 ({118]). Suppose the polynomial identity testing problem from 
Sect. 5.4 is in P. Then either NEXP ¢ P/poly, or else the permanent function 
has no polynomial-size arithmetic circuits (see Sect. 6.5.1). 


As usual, the issue is that it is not clear whether we should interpret this 
result as giving a plausible path toward proving circuit lower bounds (namely, by 
derandomizing PIT), or simply as explaining why derandomizing PIT will be hard 
(namely, because doing so will imply circuit lower bounds)! 

The fourth approach could be called lower bounds via “innocent-looking” 
combinatorics problems. Here is an example: given an n X n matrix A, say over 
the finite field F2, call A rigid if not only does A have rank Q (n), but any matrix 
obtained by changing O (n!/ 10) entries in each row of A also has rank Q (n). It is 
easy to show, via a counting argument, that almost all matrices A € F5*” are rigid. 
On the other hand, Valiant [224] made the following striking observation in 1977: if 
we manage to find any explicit example of a rigid matrix, then we also get an explicit 
example of a Boolean function that cannot be computed by any circuit of linear size 
and logarithmic depth. 

For another connection in the same spirit, given a 3-dimensional tensor A € 
Fy*"", let the rank of A be the smallest r such that A can be written as the sum 
of r rank-one tensors (that is, tensors of the form tj = x;yjzx). Then it is easy 
to show, via a counting argument, that almost all tensors A € F5*""" have rank 
Q (n°). On the other hand, Strassen [215] observed in 1973 that, if we find any 
explicit example of a 3-dimensional tensor with rank r, then we also get an explicit 
example of a Boolean function with circuit complexity Q (r). ** Alas, proving that 
any explicit matrix is rigid, or that any explicit tensor has superlinear rank, have 
turned out to be staggeringly hard problems—as perhaps shouldn’t surprise us, given 
the implications for circuit lower bounds! 

The rest of the section is organized as follows: 


¢ Section 6.1 covers logical techniques, which typically fall prey to the relativiza- 
tion barrier. 

¢ Section 6.2 covers combinatorial techniques, which typically fall prey to the 
natural proofs barrier. 

¢ Section 6.3 covers “hybrid” techniques (logic plus arithmetization), many of 
which fall prey to the algebrization barrier. 


8Going even further, Raz [184] proved in 2010 that, if we manage to show that any explicit d- 
dimensional tensor A : InJ — F has rank at least n“'—°), then we’ve also shown that the n X n 
permanent function has no polynomial-size arithmetic formulas. It’s easy to construct explicit d- 
dimensional tensors with rank nl4/2J , but the current record is an explicit d-dimensional tensor 
with rank at least 2nL4/2] + n— oO (dlogn) [18]. 

Note that, if we could show that the permanent had no nO (log ") -size arithmetic formulas, that 
would imply Valiant’s famous Conjecture 66: that the permanent has no polynomial-size arithmetic 
circuits. However, Raz’s technique seems incapable of proving formula-size lower bounds better 


than n? (log log n) . 
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e Sections 6.4 covers “ironic complexity theory” (as exemplified by the recent 
work of Ryan Williams), or the use of nontrivial algorithms to prove circuit lower 
bounds. 

¢ Section 6.5 covers arithmetic circuit lower bounds, which probably fall prey to 
arithmetic variants of the natural proofs barrier (though this remains disputed). 

¢ Section 6.6 covers Mulmuley and Sohoni’s Geometric Complexity Theory 


(GCT), an audacious program to tackle P = NP and related problems by 
reducing them to questions in algebraic geometry and representation theory (and 
which is also an example of “ironic complexity theory”). 


Note that, for the approaches covered in Sects. 6.4 and 6.6, no formal barriers are 
yet known. 


6.1 Logical Techniques 


In the 1960s, Hartmanis and Stearns [102] realized that, by simply “scaling down” 
Turing’s diagonalization proof of the undecidability of the halting problem, one can 
at least prove some separations between complexity classes. In particular, one can 
generally show that more of the same resource (time, memory, etc.) lets one decide 
more languages than less of that resource. Here is a special case of their so-called 
Time Hierarchy Theorem. 


Theorem 34 (Hartmanis-Stearns [102]). P 4 EXP. 
Proof. Let 


L = {((M) ,x,0") : M (x) halts in at most 2” steps}. 


Clearly L € EXP. On the other hand, suppose by contradiction that L €¢ P. Then 
there is some polynomial-time Turing machine A such that A (z) accepts if and only 
if z € L. LetA runin p(n + |(M)| + |x|) time. Then using A, we can easily produce 
another machine B that does the following: 


¢ Takes input ((M) , 0”). 
¢ Runs forever if M ((M) ,0") halts in at most 2” steps; otherwise halts. 


Note that, if B halts at all, then it halts after only p (2n + 2|(M)|) = n° steps. 
Now consider what happens when B is run on input ((B) ,0”). If B((B) ,0") 
runs forever, then B((B) ,0”) halts. Conversely, if B((B) ,0”) halts, then for all 
sufficiently large n, it halts in fewer than 2” steps, but that means that B ((B) , 0”) 
runs forever. So we conclude that B, and hence A, cannot have existed. |_| 


More broadly, the same argument shows that there are languages decidable in 
O (n’) time but not in O (n) time, in O (n°) time but not in O (n’) time, and so on for 
almost every natural pair of runtime bounds. (Technically, we have TIME (f (n)) 4 
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TIME (g (n)) for every f, g that are time-constructible—that is, there exist Turing 
machines that run for f (m) and g (n) steps given n as input—and that are separated 
by more than a log n multiplicative factor.) Likewise, the Space Hierarchy Theorem 
shows that there are languages decidable in O(f (n)) space but not in O (g (n)) 
space, for all natural f(m) >> g(n). Cook [68] also proved a hierarchy theorem 
for the nondeterministic time classes, which will play an important role in Sect. 6.4: 


Theorem 35 (Nondeterministic Time Hierarchy Theorem [68]). For all time- 
constructible f,g such that f(n+1) = o(g(n)), we haveNTIME (f(n)) 4 
NTIME (g (n)). 


One amusing consequence of the hierarchy theorems is that we know, for exam- 
ple, that P 4 SPACE (n), even though we don’t know either that P ¢ SPACE (n) or 
that SPACE (n) ¢ P! For suppose by contradiction that P = SPACE (n). Then 
by a padding argument (cf. Proposition 17), P would also contain SPACE n°), 
and therefore equal SPACE (n°). But then we’d have SPACE (n) = SPACE n), 
violating the Space Hierarchy Theorem. 

In summary, there really is an infinite hierarchy of harder and harder computable 
problems. Complexity classes don’t collapse in the most extreme ways imaginable, 
with (say) everything solvable in linear time. 


6.1.1 Circuit Lower Bounds Based on Counting 


A related idea—not exactly “diagonalization,’ but counting arguments made 
explicit—can also be used to show that certain problems cannot be solved by 
polynomial-size circuits. This story starts with Claude Shannon [201], who made 
the following fundamental observation in 1949. 


Proposition 36 (Shannon [201]). There exists a Boolean function f : {0,1}" > 
{0, 1}, on n variables, such that any circuit to compute f requires at least Q2 (2"/n) 
logic gates. Indeed, almost all Boolean functions on n variables (that is, a 1 — 0 (1) 
fraction of them) have this property.”° 


Proof. There are 27" different Boolean functions f on n variables, but only 


E(t) caane 
t=1 


different Boolean circuits with n inputs and at most T NAND gates. Since each 
circuit can only represent one function, and since (n + Ty" =o (27) when T = 
o (2"/n), it follows by a counting argument (i.e., the pigeonhole principle) that some 


°With some effort, Shannon’s lower bound can be shown to be tight: that is, every n-variable 
Boolean function can be represented by a circuit of size O (2"/n). (The obvious upper bound is 
O (n2").) 
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f must require a circuit with T = Q (2”/n) NAND gates—and indeed, that almost 
all of the 27" possible f’s must have this property. The number of AND, OR, and 
NOT gates required is related to the number of NAND gates by a constant factor, so 
is also Q (2"/n). | 


Famously, Proposition 36 shows that there exist Boolean functions that require 
exponentially large circuits—in fact, that almost all of them do—yet it fails to 
produce a single example of such a function! It tells us nothing whatsoever about 
3SAT or CLIQUE or any other particular function that might interest us. In that 
respect, it is similar to Shannon’s celebrated proof that almost all codes are good 
error-correcting codes, which also fails to produce a single example of such a 
code. Just like, in the decades after Shannon, the central research agenda of coding 
theory was to “make Shannon’s argument explicit” by finding specific good error- 
correcting codes, so too the agenda of circuit complexity has been to “make 
Proposition 36 explicit” by finding specific functions that provably require large 
circuits. 

In some cases, the mere fact that we know, from Proposition 36, that hard 
functions exist lets us “bootstrap” to show that particular complexity classes must 
contain hard functions. Here is an example of this. 


Theorem 37. EXPSPACE ¢ P/poly. 


Proof. Let n be sufficiently large. Then by Proposition 36, there exist functions 
f : {0, 1}” — {0, 1} with circuit complexity at least c2”/n, for some constant c > 0. 
Thus, if we list all the 27" functions in lexicographic order by their truth tables, there 
must be a first function in the list, call it f,, with circuit complexity at least c2”/n. 
We now define 


L:= | \(ee 1 2,@ =i}. 


n>1 


Then by construction, L ¢ P/poly. On the other hand, enumerating all n-variable 
Boolean functions, calculating the circuit complexity of each, and finding the first 
one with circuit complexity at least c2”/n can all be done in exponential space. 
Hence L € EXPSPACE. (1 


There is also a “scaled-down version” of Theorem 37, proved in the same way: 


Theorem 38. For every fixed k, there is a language in PSPACE that does not have 


circuits of size n.*° 


By being a bit more clever, Kannan [120] lowered the complexity class in 
Theorem 37 from EXPSPACE to NEXP'. 


Theorem 39 (Kannan [120]). NEXP” ¢ P/poly. 


Crucially, this will be a different language for each k; otherwise we would get PSPACE ¢ 
P/poly, which is far beyond our current ability to prove. 
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Proof. First, we claim that EXP? a Z P/poly. The reason is simply a more careful 


version of the proof of Theorem 37: in EXpNpN , we can do an explicit binary search 
for the lexicographically first Boolean function f, : {0, 1}” — {0, 1} such that every 
circuit of size at most (say) c2”/n disagrees with f,, on some input x. (Such an f, 
must exist by a counting argument.) 

Next, suppose by contradiction that NEXP“? Cc P/poly. Then certainly NP C 
P/poly. By the Karp-Lipton Theorem (Theorem 23), this implies that PH = ©5, so 
in particular PNP “= NPNP, By upward translation (as in Proposition 17), this in 
turn means that EXPNP™ = NEXPY?. But we already know that EXPN’ does not 
have polynomial-size circuits, and therefore neither does NEXPNP. fa] 


Amusingly, if one works out the best possible lower bound that one can get from 
Theorem 39 on the circuit complexity of a language in NEXPN?, it turns out to be 
half-exponential: that is, a function f such that f (f (7)) grows exponentially. Such 
functions exist, but have no closed-form expressions. 

Directly analogous to Theorem 38, a “scaled-down” version of the proof of 
Theorem 39 shows that, for every fixed k, there is a language in p24 = NP"? that 
does not have circuits of size n*. 

In Sect. 6.3, we will discuss slight improvements to these results that can be 
achieved with algebraic methods. Nevertheless, it (sadly) remains open even to show 
that NEXP ¢ P/poly, or that there is a language in NP that does not have linear- 
sized circuits. 


6.1.2 The Relativization Barrier 


The magic of diagonalization, self-reference, and counting arguments is how 
abstract and general they are: they never require us to “get our hands dirty” by 
understanding the inner workings of algorithms or circuits. But as was recognized 
early in the history of complexity theory, the price of generality is that the logical 
techniques are extremely limited in scope. 

Often the best way to understand the limits of a proposed approach for proving 
a statement S, is to examine what else besides S the approach would prove if 
it worked—i.e., which stronger statements S’ the approach “fails to differentiate” 
from S. If any of the stronger statements are false, then the approach cannot prove S 
either. 

That is exactly what Baker et al. [34] did for diagonalization in 1975, when they 
articulated the relativization barrier. Their central insight was that almost all the 
techniques we have for proving statements in complexity theory—such as C C D 
or C ¢ D, where C and D are two complexity classes—are so general that, if they 
work at all, then they actually prove C4 C D4 or C4 ¢ D4 for all possible oracles A. 
In other words: if all the machines that appear in the proof are enhanced in the same 
way, by being given access to the same oracle, the proof is completely oblivious to 
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that change, and goes through just as before. A proof with this property is said to 
“relativize,’ or to hold “in all possible relativized worlds” or “relative to any oracle.” 

Why do so many proofs relativize? Intuitively, because the proofs only do things 
like using one Turing machine M, to simulate a second Turing machine M) step- 
by-step, without examining either machine’s internal structure. In that case, if M2 is 
given access to an oracle A, then Mj can still simulate Mp just fine, provided that M, 
is also given access to A, in order to simulate M’s oracle calls. 

To illustrate, the reader might want to check that the proofs of Theorems 37, 38, 
and 39 can be straightforwardly modified to show that, more generally: 


« P4 4 EXP% for all oracles A. 
* EXPSPACE* ¢ P4/poly. 


Alas, Baker, Gill, and Solovay then observed that no relativizing technique can 
possibly resolve the P = NP question. For, unlike (say) P = EXP or the unsolv- 


ability of the halting problem, P = NP admits “contradictory relativizations”: there 
are some oracle worlds where P = NP, and others where P # NP. For that reason, 
any proof of P # NP will need to “notice,” at some point, that there are no oracles 
in “our” world: it will have to use techniques that fail relative to certain oracles. 


Theorem 40 (Baker-Gill-Solovay [34]). There exists an oracle A such that PA = 
NP4, and another oracle B such that P® x NP2. 


Proof Sketch. To make P4 = NP*, we can just let A be any PSPACE-complete 
language. Then it is not hard to see that PA’ = NP“ = PSPACE. 

To make P? 4 NP®, we can (for example) let B be a random oracle, as observed 
by Bennett and Gill [41]. We can then, for example, define 


L= { 0” : the first 2” bits of B contain a run of n consecutive 1's} : 


Clearly L ¢ NP®. By contrast, one can easily that L ¢ P® with probability 1 over 
B: in this case, there really is nothing for a deterministic Turing machine to do but 
brute-force search, requiring exponentially many queries to the B oracle. a 


We also have the following somewhat harder result. 


Theorem 41 (Wilson [243]). There exists an oracle A such that NEXP4 c 
P4/poly, and such that every language in NP“ has linear-sized circuits with A- 
oracle gates (that is, gates that query A). 


In other words, any proof even of NEXP ¢ P/poly—that is, of a circuit lower 
bound just “slightly” beyond those that have already been proven—will require non- 
relativizing techniques. One can likewise show that non-relativizing techniques will 
be needed to make real progress on many of the other open problems of complexity 
theory (such as proving P = BPP). 

If the relativization barrier seems too banal, the way to appreciate it is to try to 
invent techniques, for proving inclusions or separations among complexity classes, 
that fail to relativize. It’s harder than it sounds! A partial explanation for this was 
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given by Arora et al. [28], who reinterpreted the relativization barrier in logical 
terms. From their perspective, a relativizing proof is simply any proof that “knows” 
about complexity classes, only through axioms that assert the classes’ closure 
properties, as well as languages that they do contain (for example, P contains the 
empty language; if L; and Ly are both in P, then so are Boolean combinations like L; 
and L; NL»). These axioms can be shown to imply statements such as P ~ EXP. But 
other statements, like P 4 NP, can be shown to be independent of the axioms, by 
constructing models of the axioms where those statements are false. One constructs 
those models by using oracles to “force in” additional languages—such as PSPACE- 
complete languages, if one wants a world where P = NP—which the axioms might 
not require to be contained in complexity classes like P and NP, but which they 
don’t prohibit from being contained, either. The conclusion is that any proof of 
P ~ NP will need to appeal to deeper properties of the classes P and NP, properties 
that don’t follow from these closure axioms. 


6.2. Combinatorial Lower Bounds 


Partly because of the relativization barrier, in the 1980s attention shifted to combi- 
natorial approaches: that is, approaches where one tries to prove superpolynomial 
lower bounds on the number of operations of some kind needed to do something, 
by actually “rolling up one’s sleeves” and delving into the messy details of what 
the operations do (rather than making abstract diagonalization arguments). These 
combinatorial approaches enjoyed some spectacular successes, some of which 
seemed at the time like they were within striking distance of proving P # NP. 
Let us discuss some examples. 


6.2.1 Proof Complexity 


Suppose we are given a 3SAT formula g, and we want to prove that @ has 
no satisfying assignments. One natural approach to this is called resolution: we 
repeatedly pick two clauses of g, and then “resolve” the clauses (or “smash them 
together’) to derive a new clause that logically follows from the first two. This is 
most useful when one of the clauses contains a non-negated literal x, and the other 
contains the corresponding negated literal x. For example, from the clauses (x V y) 
and (x V z), it is easily seen that we can derive (y V z). The new derived clause can 
then be added to the list of clauses, and used as an input to future resolution steps. 

Now, if we ever derive the empty clause ()—say, by smashing together (x) 
and (x)—then we can conclude that our original 3SAT formula g must have been 
unsatisfiable. For in that case, g entails a clause that’s not satisfied by any setting 
of variables. Another way to say this is that resolution is a sound proof system. 
By doing an induction on the number of variables in ¢, it’s not hard to show that 
resolution is also complete: 
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Proposition 42. Resolution is a complete proof system for the unsatisfiability of 
KSAT. In other words, given any unsatisfiable kSAT formula @, there exists some 
sequence of resolution steps that produces the empty clause. 


So the key question about resolution is just how many resolution steps are needed 
to derive the empty clause, starting from an unsatisfiable formula g. If that number 
could be upper-bounded by a polynomial in the size of ¢, it would follow that NP = 
coNP. If, moreover, an appropriate sequence of resolutions could actually be found 
in polynomial time, it would follow that P = NP. 

On the other hand, when one proves completeness by induction on the number 
of variables n, the only upper bound one gets on the number of resolution steps is 
2”. And indeed, in 1985, Haken proved the following celebrated result. 


Theorem 43 (Haken [101]). There exist kSAT formulas, involving n° variables 
and clauses, for which any resolution proof of unsatisfiability requires at least 2°” 
resolution steps. An example is a kSAT formula that explicitly encodes the “nth 
Pigeonhole Principle”: that is, the statement that is no way to map n + | pigeons 
into n holes, without mapping two pigeons to the same hole. 


Haken’s nontrivial proof formalized the intuition that any resolution proof of the 
Pigeonhole Principle will ultimately be stuck “reasoning locally”: “let’s see, if I put 
this pigeon there, and that one there ... darn, it stild doesn’t work!” Such a proof has 
no ability to engage in higher-level reasoning about the total number of pigeons. 

Since Haken’s breakthrough, there have been many other exponential lower 
bounds on the sizes of unsatisfiability proofs, typically for proof systems that 
generalize resolution in some way (see Beame and Pitassi [36] for a good survey). 
These, in turn, often let us prove exponential lower bounds on the running 
times of certain kinds of algorithms. For example, there is a widely-used class 
of KSAT algorithms called DPLL (Davis-Putnam-Logemann-Loveland) algorithms 
[72], which are based on pruning the search tree of possible satisfying assignments. 
DPLL algorithms have the property that, if one looks at the search tree of their 
execution on an unsatisfiable KSAT formula g, one can read off a resolution proof 
that ¢ is unsatisfiable. From that fact, together with Theorem 43, it follows that there 
exist KSAT formulas (for example, the Pigeonhole Principle formulas) for which any 
DPLL algorithm requires exponential time. 

In principle, if one could prove superpolynomial lower bounds for arbitrary 
proof systems (constrained only by the proofs being checkable in polynomial 
time), one would get P 4 NP, and even NP 4 coNP! However, perhaps this 
motivates turning our attention to lower bounds on circuit size, which tend to be 
somewhat easier than the analogous proof complexity lower bounds, and which— 
if generalized to arbitrary Boolean circuits—would “merely” imply P 4 NP and 
NP ¢ P/poly, rather than NP 4 coNP. 
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6.2.2. Monotone Circuit Lower Bounds 


Recall, from Sect. 5.2, that if we could merely prove that any family of Boolean 
circuits to solve some NP problem required a superpolynomial number of AND, 
OR, and NOT gates, then that would imply P 4 NP, and even the stronger result 
NP ¢ P/poly (that is, NP-complete problems are not efficiently solvable by 
nonuniform algorithms). 

Now, some NP-complete languages L have the interesting property being 
monotone: that is, changing an input bit from 0 to | can change the answer from 
x € Ltox e€ L, but never from x € Ltox ¢ L. An example is the CLIQUE 
language: say, the set of all encodings of n-vertex graphs G, as adjacency matrices 
of Os and 1s, such that G contains a clique on at least ./n vertices. It’s not hard to see 
that one can decide any such language using a monotone circuit: that is, a Boolean 
circuit of AND and OR gates only, no NOT gates. For the CLIQUE language, for 
example, a circuit could simply consist of an OR of Wr) ANDs, one for each 
possible clique. It thus becomes interesting to ask what are the smallest monotone 
circuits for monotone NP-complete languages. 

In 1985, Alexander Razborov, then a graduate student, astonished the complexity 
theory world with the following result. 


Theorem 44 (Razborov [187]). Any monotone circuit for CLIQUE requires at least 
n@l8) gates. 


Subsequently, Alon and Boppana [25] improved this, to show that any 
monotone circuit to detect a clique of size ~ (n/log ny! 7 must have size 
exp (2 (( / log n)i3 )). I won’t go into the proof of Theorem 44 here, but it uses 
beautiful combinatorial techniques, including (in modern versions) the Erdds-Rado 
sunflower lemma. 

The significance of Theorem 44 is this: if we could now merely prove that any 
circuit for a monotone language can be made into a monotone circuit without much 
increasing its size, then we’d immediately get P 4 NP and even NP ¢ P/poly. 
And indeed, this was considered a potentially-viable approach to proving P 4 NP 
for some months. Alas, the approach turned out to be a dead end, for the following 
reason. 


Theorem 45 ([188, 220]). There are monotone languages even in P that require 
exponentially-large monotone circuits. An example is the MATCHING language, 
consisting of all adjacency-matrix encodings of n-vertex graphs that admit a 
matching on at least ~ (n/ log ny?! > vertices. This language requires monotone 
circuits of size exp (2 ((n/ log n)'/3)), 

Thus, while Theorem 44 stands as a striking example of the power of combina- 
torics to prove circuit lower bounds, ultimately it tells us not about the hardness 
of NP-complete problems, but only about the weakness of monotone circuits. 
Theorem 45 implies that, even if we are trying to compute a monotone Boolean 
function (such as the MATCHING function), allowing ourselves the non-monotone 
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NOT gate can yield an exponential reduction in circuit size. Alas, Razborov’s 
techniques break down completely as soon as a few NOT gates are available.*! 

I should also mention lower bounds on monotone depth. In the STCON (s, f- 
connectivity) problem, we’re given as input the adjacency matrix of an undirected 
graph, and asked whether or not there is a path between two designated vertices 
s and t. By using their connection between circuit depth and communication 
complexity (see Sect. 6), Karchmer and Wigderson [121] were able to prove that any 
monotone circuit for STCON requires Q (log? n) depth—and as a consequence, that 
any monotone formula for STCON requires n°“'°S” size, Since STCON is known to 
have monotone circuits of polynomial size, this implies in particular that monotone 
formula size and monotone circuit size are not polynomially related. 


6.2.3 Small-Depth Circuits and the Random Restriction Method 


Besides restricting the allowed gates (say, to AND and OR only), there’s a second 
natural way to “hobble” a circuit, and thereby potentially make it easier to prove 
lower bounds on circuit size. Namely, we can restrict the circuit’s depth, the number 
of layers of gates between input and output. If the allowed gates all have a “fanin” 
of 1 or 2 (that is, they all take only | or 2 input bits), then clearly any circuit that 
depends nontrivially on all n of the input bits must have depth at least log, n. On the 
other hand, if we allow gates of unbounded fanin—for example, ANDs or XORs or 
MAJORITYs on unlimited numbers of inputs—then it makes sense to ask what can 
be computed even by circuits of constant depth. Constant-depth circuits are very 
closely related to neural networks, which also consist of a small number of layers 
of “logic gates” (i.e., the neurons), with each neuron allowed to have very large 
“fanin”—1.e., to accept input from many or all of the neurons in the previous layer. 

If we don’t also restrict the number of gates or neurons, then it turns out that 
every function can be computed in small depth: 


Proposition 46. Every Boolean function f : {0, 1}" — {0,1} can be computed by 
an unbounded-fanin, depth-3 circuit of size O (n2"): namely, by an OR of ANDs of 
input bits and their negations. 


Proof. We simply need to check whether the input, x € {0,1}”", is one of the z’s 
such that f (z) = 1: 


3!Note that, if we encode the input string using the so-called dual-rail representation—in which 
every 0 is represented by the 2-bit string 01, and every 1 by 10—then the monotone circuit 
complexities of CLIQUE, MATCHING, and so on do become essentially equivalent to their non- 
monotone circuit complexities, since we can push all the NOT gates to the bottom layer of the 
circuit using de Morgan’s laws. Unfortunately, Razborov’s lower bound techniques also break 
down under dual-rail encoding. 


f(x) = VV /\ xi] A A Xj 


Z=Z "Zn 2 f(D=1 i€{1,....n} 2 g=1 i€{1,....n} : 7=0 


Similarly, in typical neural network models, every Boolean function can be 
computed by a network with ~ 2” neurons arranged into just two layers. 

So the interesting question is what happens if we restrict both the depth and the 
number of gates or neurons. More formally, let AC° be class of languages L C 
{0, 1}* for which there exists a family of circuits {C,,} one for each input size n, 
such that: 


(1) C, (x) outputs 1 if x € Land 0 if x ¢ L, for all n and x € {0, 1}". 

(2) Each C,, consists of unbounded-fanin AND and OR gates, as well as NOT gates. 
(3) There is a polynomial p such that each C,, has at most p (n) gates. 

(4) There is a constant d such that each C,, has depth at most d. 


n>1? 


Clearly AC° is a subclass of P/poly; indeed we recover P/poly by omitting 
condition (4). Now, one of the major triumphs of complexity theory in the 1980s 
was to understand AC°, as we still only dream of understanding P/poly. It’s not just 
that we know NP ¢ AC°; rather, it’s that we know in detail which problems are and 
aren’t in AC° (even problems within P), and exactly how many gates are needed 
for each given depth d. As the most famous example, let PARITY be the language 
consisting of all strings with an odd number of ‘1’ bits. Then: 


Theorem 47 (Ajtai [16], Furst-Saxe-Sipser [85]). PARITY is not in AC 


While the original lower bounds on the size of AC° circuits for PARITY were only 
slightly superpolynomial, Theorem 47 was subsequently improved by Yao [245] and 
then by Hastad [103], the latter of whom gave an essentially optimal result: namely, 
any AC? circuit for PARITY of depth d requires at least 2a?) gates. 

The first proofs of Theorem 47 used what is called the method of random 
restrictions. In this method, we assume by contradiction that we have a size-s, depth- 
d, unbounded-fanin circuit C for our Boolean function—say, the PARITY function. 
We then randomly fix most of the input bits to 0 or 1, while leaving a few input 
bits unfixed. What we hope to find is that the random restriction “kills off” an entire 
layer of gates—because any AND gate that takes even one constant 0 bit as input 
can be replaced by the constant 0 function, and likewise, any OR gate that takes 
even one | bit as input can be replaced by the constant | function. Thus, any AND 
or OR gate with a large fanin is extremely likely to be killed off; gates with small 
fanin might not be killed off, but can be left around to be dealt with later. We then 
repeat this procedure, randomly restricting most of the remaining unfixed bits, in 
order to kill off the next higher layer of AND and OR gates, and so on through all 
d layers. By the time we are done, we have reduced C to a shadow of its former 
self: specifically, to a circuit that depends on only a constant number of input bits. 
Meanwhile, even though only a tiny fraction of the input bits (say, n'/4 of them) 
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remain unfixed, we still have a nontrivial Boolean function on those bits: indeed, it 
is easy to see that any restriction of the PARITY function to a subset S of bits will 
either be PARITY itself, or else NOT(PARITY). But a circuit of constant size clearly 
can’t compute a Boolean function that depends on ~ n!4 input bits. This yields our 
desired contradiction. 

At a high level, there were three ingredients needed for the random restriction 
method to work. First, the circuit needed to built out of AND and OR gates, which 
are likely to get killed off by random restrictions. The method would not have 
worked if the circuit contained unbounded-fanin MAJORITY gates (as a neural 
network does), or even unbounded-fanin XOR gates. Second, it was crucial that the 
circuit depth d was small, since we needed to shrink the number of unfixed input 
variables by a large factor d times, and then still have unfixed variables left over. It 
turns out that random restriction arguments can yield some lower bound whenever 


d=o0 (ee -), but not beyond that. Third, we needed to consider a function, such 
as PARITY, that remains nontrivial even after the overwhelming majority of input 
bits have been randomly fixed to 0 or 1. The method wouldn’t have worked, for 
example, for the n-bit AND function (which is unsurprising, since the AND function 
does have a depth-1 circuit, consisting of a single AND gate!). 

The original proofs for PARITY ¢ AC° have been generalized and improved 
on in many ways. For example, Linial et al. [144] examined the weakness of 
AC° circuits from a different angle: “turning lemons into lemonade,” they gave 
a quasipolynomial-time algorithm to /earn arbitrary AC° circuits with respect to 
the uniform distribution over inputs. Also, proving a conjecture put forward by 
Linial and Nisan [145] (and independently Babai), Braverman [51] showed that AC 
circuits cannot distinguish the outputs of a large range of pseudorandom generators 
from truly random strings. 

Meanwhile, Hastad [103] showed that for every d, there are functions computable 
by AC° circuits of depth d that require exponentially many gates for AC° circuits of 
depth d — 1. This implies that there exists an oracle relative to which PH is infinite. 
Improving that result, Rossman, Servedio, and Tan [193] very recently showed that 
the same functions Hastad had considered require exponentially many gates even 
to approximate using AC° circuits of depth d — 1. This implies that PH is infinite 
relative to a random oracle with probability 1, resolving a 30-year-old open problem. 

The random restriction method has also had other applications in complexity 
theory, besides to AC°. Most notably, it’s been used to prove polynomial lower 
bounds on formula size. The story of formula-size lower bounds starts in 1961 with 
Subbotovskaya [216], who used random restrictions to show that the n-bit PARITY 
function requires formulas of size Q (n'°). Later Khrapchenko [128] improved this 
to Q (n’), which is tight.°? Next, in 1987, Andreev [26] constructed a different 


32 Assume for simplicity that n is a power of 2. Then x; @ --- @ x, can be written as y ® z, where 
y= x1 +++ @ Xy/2 and Z2= Xn/241 @ +++ ® Xy. This in turn can be written as (y AZ) V (¥ A z). 
Expanding recursively now yields a size-n? formula for PARITY, made of AND, OR, and NOT 
gates. 
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Boolean function in P that could be shown, again using random restrictions, to 
require formulas of size n?>~°). This was subsequently improved to n7>~?) by 
Impagliazzo and Nisan [114], to n*-° by Paterson and Zwick [176], and finally 


to 3-9) by Hastad [104] and to Q (Se) by Tal [219]. Unfortunately, 
the random restriction method seems fundamentally incapable of going beyond 
Q (n’). On the other hand, for Boolean circuits rather than formulas, we still have 


no lower bound better than /inear for any function in P (or for that matter, in NP).! 


6.2.4 Small-Depth Circuits and the Polynomial Method 


For our purposes, the most important extension of Theorem 47 was achieved by 
Smolensky [209] and Razborov [189] in 1987. Let AC° [m] be the class of languages 
decidable by a family of constant-depth, polynomial-size, unbounded-fanin circuits 
with AND, OR, NOT, and MOD-m gates (gates output | if their number of ‘1’ input 
bits is divisible by m, and 0 otherwise). Then Smolensky and Razborov extended 
the class of circuits for which lower bounds can be proven from AC° to AC° [p], 
whenever p is prime. 


Theorem 48 (Smolensky [209], Razborov [189]). Let p and q be distinct primes. 
Then MOD,, the set of all strings with Hamming weight divisible by q, is not in 
AC? [p]. Indeed, any AC° [p] circuit for MOD, of depth d requires 22(nl™) gates. 
As a corollary, the MAJORITY function is also not in AC° [p], and also requires 
22(n/™/) gates to compute using AC° [p] circuits of depth d. 


It is not hard to show that AC° [p] = AC° [p*| for any k > 1, and thus, one also 
gets lower bounds against AC° [m], whenever m is a prime power. 

The proof of Theorem 48 uses the so-called polynomial method. Here one 
argues that, if a function f can be computed by a constant-depth circuit with AND, 
OR, NOT, and MOD-p gates, then f can also be approximated by a low-degree 
polynomial over the finite field F’,. One then shows that a function of interest, such 
as the MOD, function (for g 4 p), can’t be approximated by any such low-degree 
polynomial. This provides the desired contradiction. 

The polynomial method is famously specific in scope: it’s still not known how 
to generalize the method even to AC° [m] circuits, where m is not a prime power. 
The reason why it breaks down there is simply that there are no finite fields of non- 
prime-power order. And thus, to take an example, it’s still open whether the n-bit 
MAJORITY function has a constant-depth, polynomial-size, unbounded-fanin circuit 
consisting of AND, OR, NOT, and MOD-6 gates (or even entirely of MOD-6 gates)! 

Stepping back, it’s interesting to ask whether the constant-depth circuit lower 
bounds evade the relativization barrier explained in Sect.6.1.2. There’s some 
disagreement about whether it’s even sensible to feed oracles to tiny complexity 
classes such as AC° (see Allender and Gore [22] for example). However, to whatever 
extent it is sensible, the answer is that these lower bounds do evade relativization. 
For example, if by (Ac°)*, we mean AC° extended by “oracle gates” that query A, 
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then it’s easy to construct an A such that (Ac°)* = P4: for example, any A that 
is P-complete under AC°-reductions will work. On the other hand, we know from 
Theorem 47 that AC° ¥ P in the “real,” unrelativized world. 


6.2.5 The Natural Proofs Barrier 


Despite the weakness of AC° and AC° [mm] circuits, the progress on lower bounds for 
them suggested what seemed to many researchers like a plausible path to proving 
NP ¢ P/poly, and hence P 4 NP. That path is simply to generalize the random 
restriction and polynomial methods further and further, to get lower bounds for 
more and more powerful classes of circuits. The first step, of course, would be to 
generalize the polynomial method to handle AC° [m] circuits, where m is not a prime 
power. Then one could handle what are called TC° circuits: that is, constant-depth, 
polynomial-size, unbounded-fanin circuits with MAJORITY gates (or, as in a neural 
network, threshold gates, which output | if a certain weighted affine combination of 
the input bits exceeds 0, and 0 otherwise). Next, one could aim for polynomial-size 
circuits of logarithmic depth: that is, the class NC!. Finally, one could push all the 
way to polynomial-depth circuits: that is, the class P/poly. 

Unfortunately, we now know that this path hits a profound barrier at TC°, if not 
earlier—a barrier that explains why the random restriction and polynomial methods 
haven’t taken us further toward a proof of P # NP. Apparently this barrier was 
known to some experts in the 1980s, but it was first articulated in print in 1993 by 
Razborov and Rudich [191], who called it the natural proofs barrier. 

The basic insight is that combinatorial techniques, such as the method of random 
restrictions, do more than advertised: in some sense, they do too much for their own 
good. In particular, not only do they let us show that certain specific functions, like 
PARITY, are hard for AC°; they even let us certify that a random function is hard 
for AC°. Indeed, such techniques give rise to an algorithm, which takes as input the 
truth table of a Boolean function f : {0, 1}” — {0,1}, and which has the following 
two properties. 


(1) “Constructivity.” The algorithm runs in time polynomial in the size of f’s truth 
table (that is, polynomial in 2”). 

(2) “Largeness.” If f is chosen uniformly at random, then with probability at least 
1/n°™ over f, the algorithm certifies that f is hard (i.e., that f is not in some 
circuit class C, such as AC° in the case of the random restriction method). 


If a lower bound proof gives rise to an algorithm satisfying (1) and (2), then 
Razborov and Rudich call it a natural proof. In many cases, it’s not entirely obvious 
that a lower bound proof is natural, but with some work one can show that it is. To 
illustrate, in the case of the random restriction method, the algorithm could check 
that f has a large fraction of its Fourier mass on high-degree Fourier coefficients, or 
that f has high “average sensitivity” (that is, if x and y are random inputs that differ 
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only in a single bit, then with high probability f (x) 4 f (y)). These tests have the 
following three properties: 


e They are easy to perform, in time polynomial in the truth table size 2”. 

e A random function f will pass these tests with overwhelming probability (that is, 
such an f will “look like PARITY” in the relevant respects). 

e The results of Linial, Mansour, and Nisan [144] show that any f that passes 
these tests remains nontrivial under most random restrictions, and for that reason, 
cannot be in AC’. 


But now, twisting the knife, Razborov and Rudich point out that any natural 
lower bound proof is self-defeating, in that it yields an efficient algorithm to solve 
some of the same problems that we’d set out to prove were hard. More concretely, 
suppose we have a natural lower bound proof against the circuit class C. Then 
by definition, we also have an efficient algorithm A that, given a random Boolean 
function f : {0,1}" > {0,1}, certifies that f ¢ C with at least 1/n? probability 
over f. But this means that C cannot contain very strong families of pseudorandom 
functions: namely, functions f : {0,1}" — {0,1} that are indistinguishable from 
“truly” random functions, even by algorithms that can examine their entire truth 
tables and use time polynomial in 2”. 

Why not? Because A can never certify f ¢ C if f is a pseudorandom function, 
computable in C. But A certifies f ¢ C with 1/n°” probability over a truly random f. 
Thus, A serves to distinguish random from pseudorandom functions with non- 
negligible*’ bias—so the latter were never really pseudorandom at all. 

To recap, we’ve shown that, if there’s any natural proof that any function is 
not in C, then all Boolean functions computable in C can be distinguished from 
random functions by 2)-time algorithms. That might not sound so impressive, 
since 2°” is a lot of time. But a key observation is that, for most of the circuit 
classes C that we care about, there are families of pseudorandom functions {f,}, on 
n bits that are conjectured to require 2?) time to distinguish from truly random 
functions, where p (n) is as large a polynomial as we like (related to the length of 
the random “seed” s). It follows from results of Naor and Reingold [170] that in 
TC° (constant-depth, polynomial-size threshold circuits), there are functions that 
can’t be distinguished from random functions in 2°”) time, unless the factoring 
and discrete logarithm problems are solvable in O (2”) time for every ¢ > 0. (For 


comparison, the best known algorithms for these problems take roughly one time.) 
Likewise, Banerjee et al. [35] showed that in TC, there are functions that can’t be 
distinguished from random in 2°” time, unless noisy systems of linear equations 
can be solved in O (2) time for every ¢ > 0. 

It’s worth pausing to let the irony sink in. Razborov and Rudich are pointing out 
that, as we showed certain problems (factoring and discrete logarithm) to be harder 
and harder via a natural proof, we’d simultaneously show those same problems to 
be easier and easier! Indeed, any natural proof showing that these problems took 


33%n theoretical computer science, the term non-negligible means lower-bounded by 1/n?. 
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at least t (n) time, would also show that they took at most roughly 2' ~'® time. As 
a result, no natural proof could possibly show these problems take more than half- 
exponential time: that is, time ¢ (m) such that t (t (n)) grows exponentially. 

Here, perhaps, we are finally face-to-face with a central conceptual difficulty of 


the P = NP question: namely, we’re trying to prove that certain functions are hard, 
but the problem of deciding whether a function is hard is itself hard, according to 
the very sorts of conjectures that we’re trying to prove.*4 

Of course, the natural proofs barrier didn’t prevent complexity theorists from 
proving strong lower bounds against AC°. But the result of Linial et al. [144] can be 
interpreted as saying that this is because AC? is not yet powerful enough to express 
pseudorandom functions. When we move just slightly higher, to TC° (constant- 
depth threshold circuits), we do have pseudorandom functions under plausible 
hardness assumptions, and—not at all coincidentally, according to Razborov and 
Rudich—we no longer have strong circuit lower bounds. In that sense, natural proofs 
explains almost precisely why the progress toward proving P 4 NP via circuit 
complexity stalled where it did. The one complication in the story is the AC° [m] 
classes, for which we don’t yet have strong lower bounds (though see Sect. 6.4), 
but also don’t have pseudorandom function candidates. For those classes, it’s still 
possible that natural proofs could succeed. 

As Razborov and Rudich themselves stressed, the take-home message is not 
that we should give up on proving P + NP. In fact, since the beginning of 
complexity theory, we’ve had at least one technique that easily evades the natural 
proofs barrier: namely, diagonalization (the technique used to prove P # EXP; 
see Sect. 6.1)! The reason why diagonalization evades the barrier is that it zeroes 
in on a specific property of the function f being lower-bounded—namely, the fact 
that f is EXP-complete, and thus able to simulate all P machines—and thereby 
avoids the trap of arguing that “f is hard because it looks like a random function.” 
Of course, diagonalization is subject to the relativization barrier (see Sect. 6.1.2), 
so the question still stands of how to evade relativization and natural proofs 
simultaneously; we’ ll return to that question in Sect. 6.3. 

More broadly, there are many cases in mathematics where we can prove that 
some object O of interest to us has a property P, even though we have no hope of 
finding a general polynomial-time algorithm to decide whether any given object has 
property P, or even to certify a large fraction of objects as having property P. In 
such cases, often we prove that O has property P by exploiting special symmetries 
in O—symmetries that have little to do with why O has property P, but everything 
to do with why we can prove it has the property. As an example, a random graph is 
an expander graph (that is, a graph on which a random walk mixes rapidly) with 
overwhelming probability. But since the general problem of deciding whether a 


+4Technically, the problem of distinguishing random from pseudorandom functions is equivalent to 
the problem of inverting one-way functions, which is not quite as strong as solving NP-complete 
problems in polynomial time—only solving average-case NP-complete problems with planted 
solutions. For more see Sect. 5.3. 
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graph is an expander is NP-hard, if we want a specific graph G that’s provably 
an expander, typically we need to construct G with a large amount of symmetry: 
for example, by taking it to be the Cayley graph of a finite group. Similarly, even 
though we expect that there’s no general efficient algorithm to decide if a Boolean 
function f is hard,*> given as input f’s truth table, we might be able to prove that 
certain specific f’s (for example, NP- or #P-complete ones) are hard by exploiting 
their symmetries. Geometric Complexity Theory (see Sect. 6.6) is the best-known 
development of that particular hope for escaping the natural proofs barrier. 

But GCT is not the only way to use symmetry to evade natural proofs. As a 
vastly smaller example, I [2, Appendix 10] proved an exponential lower bound 
on the so-called manifestly orthogonal formula size of a function f : {0,1}" > 
{0,1} that outputs 1 if the input x is a codeword of a linear error-correcting 
code, and 0 otherwise. Here a manifestly orthogonal formula is a formula over 
X1,--+,Xn,X1,---,Xp, consisting of OR and AND gates, where every OR must be 
of two subformulas over the same set of variables, and every AND must be of 
two subformulas over disjoint sets of variables. My lower bound wasn’t especially 
difficult, but what’s notable about it took crucial advantage of a symmetry of linear 
error-correcting codes: namely, the fact that any such code can be recursively 
decomposed as a disjoint union of Cartesian products of smaller linear error- 
correcting codes. My proof thus gives no apparent insight into how to certify that 
a random Boolean function has manifestly orthogonal formula size exp (n), and 
possibly evades the natural proofs barrier (if there is such a barrier in the first place 
for manifestly orthogonal formulas). 

Another proposal for how one could use symmetry to evade the natural proofs 
barrier comes from a beautiful 2010 paper by Allender and Koucky [24] (see also 
Allender’s survey [20]). These authors show that, if one wanted to prove that certain 
specific NC! problems were not in TC°, thereby establishing the breakthrough 
separation TC? 4 NC’, it would suffice to show that those problems had no TC° 
circuits of size n'*®, for any constant ¢ > 0. To achieve this striking “bootstrap,” 
from an n'T* lower bound to a superpolynomial one, Allender and Koucky exploit 
the self-reducibility of the NC! problems in question, the fact that they can be 
reduced to smaller instances of themselves. Crucially, this self-reducibility would 
not hold for a random function. For this reason, the proposed lower bound method 
has at least the potential to evade the natural proofs barrier. Indeed, it’s not even 
totally implausible that a natural proof could yield an n'+* lower bound for TC° 


The problem, given as input the truth table of a Boolean function f : {0,1}" — {0,1}, 
of computing or approximating the circuit complexity of f is called the Minimum Circuit Size 
Problem (MCSP). It is a longstanding open problem whether or not MCSP is NP-hard; at any rate, 
there are major obstructions to proving it NP-hard with existing techniques (see Kabanets and Cai 
[117] and Murray and Williams [169]). On the other hand, MCSP cannot be in P (or BPP) unless 
there are no cryptographically-secure pseudorandom generators. At any rate, what is relevant to 
natural proofs is just whether there is an efficient algorithm to certify a large fraction of Boolean 
functions as being hard, which is a weaker requirement than solving MCSP. 
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circuits, with the bootstrapping from n'** to superpolynomial the only non-natural 
part of the argument.*° 


6.3 Arithmetization 


In the previous sections, we saw that there are logic-based techniques (like 
diagonalization) that suffice to prove P 4 EXP and NEXP“? ¢ P/poly, and that 
indeed evade the natural proofs barrier, but that are blocked from proving P 4 NP 
by the relativization barrier. Meanwhile, there are combinatorial techniques (like 
random restrictions) that suffice to prove circuit lower bounds against AC° and 
AC° [p], and that evade the relativization barrier, but that are blocked from proving 
lower bounds against P/poly (and hence, from proving P 4 NP) by the natural 
proofs barrier. 

This situation raises a question: couldn’t we simply combine techniques that 
evade relativization but not natural proofs, with techniques that evade natural proofs 
but not relativization, in order to evade both? As it turns out, we can. 


6.3.1 IP = PSPACE 


The story starts with a dramatic development in complexity theory around 1990, 
though not one that obviously bore on P # NP or circuit lower bounds. In the 
1980s, theoretical cryptographers became interested in so-called interactive proof 
systems, which are protocols where a computationally-unbounded but untrustworthy 
prover (traditionally named Merlin) tries to convince a skeptical polynomial-time 
verifier (traditionally named Arthur) that some mathematical statement is true, via a 
two-way conversation, in which Arthur can randomly generate challenges and then 
evaluate Merlin’s answers to them. 

More formally, let IP (Interactive Proof) be the class of all languages L © {0, 1}* 
for which there exists a probabilistic polynomial-time algorithm for Arthur with the 
following properties. Arthur receives an input string x € {0, 1}” (which Merlin also 
knows), and then generates up to n?) challenges to send to Merlin. Each challenge 
is a string of up to n°” bits, and each can depend on x, on Arthur’s internal random 
bits, and on Merlin’s responses to the previous challenges. (We also allow Arthur, if 
he likes, to keep some random bits hidden, without sending them to Merlin—though 


3©Allender and Koucky’s paper partly builds on 2003 work by Srinivasan [211], who showed 
that, to prove P # NP, one would “merely” need to show that any algorithm to compute weak 
approximations for the MAXCLIQUE problem takes Q (n't*) time, for some constant ¢ > 0. 
The way Srinivasan proved this striking statement was, again, by using a sort of self-reduciblity: 
he showed that, if there’s a polynomial-time algorithm for MAXCLIQUE, then by running that 
algorithm on smaller graphs sampled from the original graph, one can solve approximate versions 
of MAXCLIQUE in n't? time. 
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surprisingly, this turns out not to make any difference [90].) We think of Merlin as 
trying his best to persuade Arthur that x € L; at the end, Arthur decides whether to 
accept or reject Merlin’s claim. We require that for all inputs x: 


¢ Ifx € L, then there is some strategy for Merlin (i.e., some function determining 
which message to send next, given x and the sequence of challenges so far*’) 
that causes Arthur to accept with probability at least 2/3 over his internal 
randomness. 

¢ If x ¢ L, then regardless of what strategy Merlin uses, Arthur rejects with 
probability at least 2/3 over his internal randomness. 


Clearly IP generalizes NP: indeed, we recover NP if we get rid of the interaction 
and randomness aspects, and just allow a single message from Merlin, which Arthur 
either accepts or rejects. In the other direction, it’s not hard to show that IP C 
PSPACE.*® 

The question asked in the 1980s was: does interaction help? how much bigger 
is IP than NP? It was observed that IP contains at least a few languages that aren’t 
known to be in NP, such as graph non-isomorphism. This is so because of a simple, 
famous, and elegant protocol [89]: given two n-vertex graphs G and H, Arthur can 
pick one of the two uniformly at random, randomly permute its vertices, then send 
the result to Merlin. He then challenges Merlin: which graph did I start from, G or 
H? If G # H, then Merlin, being computationally unbounded, can easily answer 
this challenge by solving graph isomorphism. If, on the other hand, G = H, then 
Merlin sees the same distribution over graphs regardless of whether Arthur started 
from G or H, so he must guess wrongly with probability 1/2. 

Despite such protocols, the feeling in the late 1980s was that IP should be only 
a “slight” extension of NP. This feeling was buttressed by a result of Fortnow and 
Sipser [83], which said that there exists an oracle A such that coNP“ ¢ IP“, and 
hence, any interactive protocol even for coNP (e.g., for proving Boolean formulas 
unsatisfiable) would require non-relativizing techniques. 

Yet in the teeth of that oracle result, Lund et al. [151] showed nevertheless that 
coNP C IP “in the real world”—and not only that, but P#P Cc IP. This was quickly 
improved by Shamir [200] to the following striking statement: 


Theorem 49 ((151, 200]). IP = PSPACE. 


Theorem 49 means, for example, that if a computationally-unbounded alien came 
to Earth, it could not merely beat us in games of strategy like chess—rather, the 


37We can assume without loss of generality that Merlin’s strategy is deterministic, since Merlin is 
computationally unbounded, and any convex combination of strategies must contain a deterministic 
strategy that causes Arthur to accept with at least as great a probability as the convex combination 
does. 

38This is so because a polynomial-space Turing machine can treat the entire interaction between 
Merlin and Arthur as a game, in which Merlin is trying to get Arthur to accept with the largest 
possible probability. The machine can then evaluate the exponentially large game tree using depth- 
first recursion. 
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alien could mathematically prove to us, via a short conversation and to statistical 
certainty, that it knew how to play perfect chess. Theorem 49 has been hugely 
influential in complexity theory for several reasons, but one reason was that it 
illustrated, dramatically and indisputably, that the relativization barrier need not 
inhibit progress. 

So how was this amazing result achieved, and why does the proof fail relative 
to certain oracles? The trick is what we now call arithmetization. This means that 
we take a Boolean formula or circuit—involving, for example, AND, OR, and 
NOT gates—and then reinterpret the Boolean gates as arithmetic operations over 
some larger finite field F,. More concretely, the Boolean AND (x A y) becomes 
multiplication (xy), the Boolean NOT becomes the function 1 — x, and the Boolean 
OR (x Vv y) becomes x + y — xy. Note that if x,y € {0,1}, then we recover the 
original Boolean operations. But the new operations make sense even if x, y ¢ {0, 1}, 
and they have the effect of lifting our Boolean formula or circuit to a multivariate 
polynomial over F,. Furthermore, the degree of the polynomial can be upper- 
bounded in terms of the size of the formula or circuit. 

The advantage of this lifting is that polynomials, at least over large finite fields, 
have powerful error-correcting properties that are unavailable in the Boolean case. 
These properties ultimately derive from the Fundamental Theorem of Algebra: 
a nonzero, degree-d univariate polynomial has at most d roots. As a consequence, if 
q.q : F, — F, are two degree-d polynomials that are unequal (and d < p), then 
with high probability, their inequality can be seen by querying them at a random 
point: 


via 


Pr lg) = @)] < 


Let me now give a brief impression of how one proves Theorem 49, or at least the 
simpler result coNP C IP. Let 9 (41,...,X,) be, say, a 3SAT formula that Merlin 
wants to convince Arthur is unsatisfiable. Then Arthur first lifts g to a multivariate 
polynomial q : F; > F,, of degree d < |g| (where |9| is the size of ¢), over the 
finite field F,, for some p >> 2”. Merlin’s task is equivalent to convincing Arthur of 
the following equation: 


>, q(X1,-+-,X,) = 0. 


X14... €{0,1} 


To achieve this, Merlin first sends Arthur the coefficients of a univariate polynomial 
qi : F, > Fp. Merlin claims that q; satisfies 


gen= >) @@indencm,); (1) 


X2,.-.,Xn €{0, 1} 


and also satisfies g; (0) + q; (1) = 0. Arthur can easily check the latter equation for 
himself. To check Eq. (1), Arthur picks a random value r; € F, for x; and sends it 
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to Merlin. Then Merlin replies with a univariate polynomial q2, for which he claims 
that 


ar(2)= Dg (71.82.83... an) 


X3,...X, €{0, 1} 


Arthur checks that q2 (0) + q2 (1) = qi (1), then picks a random value 72 € F, 
for x2 and sends it to Merlin, and so on. Finally, Arthur checks that g, is indeed the 
univariate polynomial obtained by starting from the arithmetization of ¢, then fixing 
X1,---,Xp-1 tO 11,...,%—1 respectively. The Fundamental Theorem of Algebra 
ensures that, if Merlin lied at any point in the protocol, then with high probability at 
least one of Arthur’s checks will fail. 

Now, to return to the question that interests us: why does this protocol escape 
the relativization barrier? The short answer is: because if the Boolean formula 
gy involved oracle gates, then we wouldn’t have been able to arithmetize g. By 
arithmetizing gy, we did something “deeper” with it, more dependent on its structure, 
than simply evaluating g on various Boolean inputs (which would have continued 
to work fine had an oracle been involved). Arithmetization made sense because @ 
was built out of AND and OR and NOT gates, which we were able to reinterpret 
arithmetically. But how would we arithmetically reinterpret an oracle gate? 


6.3.2 Hybrid Circuit Lower Bounds 


To recap, PSPACE C IP is a non-relativizing inclusion of complexity classes. But 
can we leverage that achievement to prove non-relativizing separations between 
complexity classes, with an eye toward P # NP? Certainly, by combining IP = 
PSPACE with the Space Hierarchy Theorem (which implies SPACE (n*) x 
PSPACE for every fixed k), we get that IP ¢@ SPACE(n‘) for every fixed k. 
Likewise, by combining IP = PSPACE with Theorem 38 (that PSPACE does not 
have circuits of size n‘ for fixed k), we get that IP doesn’t have circuits of size n‘ 
either. Furthermore, both of these separations can be shown to be non-relativizing, 
using techniques from [56]. But can we get more interesting separations? 

The key to doing so turns out to be a beautiful corollary of the IP = PSPACE 
theorem. To state the corollary, we need one more complexity class: MA (Merlin- 
Arthur) is a probabilistic generalization of NP. It’s defined as the class of languages 
L C {0, 1}* for which there exists a probabilistic polynomial-time verifier M, and a 
polynomial p, such that for all inputs x € {0, 1}*: 


* Ifx € L then there exists a witness string w € {0, 1}”") such that M (x, w) 
accepts with probability at least 2/3 over its internal randomness. 

¢ Ifx ¢ L, then M(x,w) rejects with probability at least 2/3 over its internal 
randomness, for all w. 
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Clearly MA contains NP and BPP. It can also be shown that MA c ©5115 
and that MA C PP, where PP is the counting class from Sect. 2.2.6. Now, here’s 
the corollary of Theorem 49: 


Corollary 50. If PSPACE ¢ P/poly, then PSPACE = MA. 


Proof. Suppose PSPACE c P/poly, let L € PSPACE, and let x be an input in L. 
Then as an MA witness proving that x € L, Merlin simply sends Arthur a description 
of a polynomial-size circuit C that simulates the PSPACE prover, in an interactive 
protocol that convinces Arthur that x € L. (Here we use one additional fact about 
Theorem 49, beyond the mere fact that IP = PSPACE: that, in the protocol, 
Merlin can run a PSPACE algorithm to decide which message to send next.) Then 
Arthur simulates the protocol, using C to compute Merlin’s responses to his random 
challenges, and accepts if and only if the protocol does. Hence L € MA. a 


Likewise: 
Corollary 51. If P#P Cc P/poly, then p#P — MA. 


(Again, here we use the observation that, in the protocol proving that P*? C IP, 
Merlin can run a P*? algorithm to decide which message to send next.) 

Let’s now see how we can use these corollaries of IP = PSPACE to prove new 
circuit lower bounds. Let MAexp be “the exponential-time version of MA,” with a 
2?\)_size witness that can be probabilistically verified in 2?” time: in other words, 
the class that is to MA as NEXP is to NP. Then: 


Theorem 52 (Buhrman-Fortnow-Thierauf [56]). MAgxp ¢ P/poly. 


Proof. Suppose by contradiction that MAgxp C P/poly. Then certainly PSPACE C 
P/poly, which means that PSPACE = MA by Corollary 50. By a padding argument 
(see Proposition 17), this means that EXPSPACE = MAexp. But we already saw in 
Theorem 37 that EXPSPACE ¢ P/poly, and therefore MAgxp ¢ P/poly as well. 


Note in particular that if we could prove MA = NP, then we would also have 
MAexp = NEXP by padding, and hence NEXP ¢ P/poly by Theorem 52. This 
provides another example of how derandomization can lead to circuit lower bounds, 
a theme mentioned in Sect. 6. 

A second example involves the class PP. 


Theorem 53 (Vinodchandran [231]). For every fixed k, there is a language in PP 


that does not have circuits of size n*. 


Proof. Fix k, and suppose by contradiction that PP has circuits of size n*. Then 
in particular, PP C P/poly, so PPP = P#P C P/poly, so P#? = PP = MA by 


Corollary 51. But we noted in Sect. 6.1 that pI does not have circuits of size n*. 
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And a c p#P by Toda’s Theorem (Theorem 13), so P#P doesn’t have circuits of 
size n* either. Therefore neither does PP.*? a 


As a final example, Santhanam [195] showed the following (we omit the proof). 


Theorem 54 (Santhanam [195]). For every fixed k, there is an MA “promise 
problem” that does not have circuits of size n*. 


The above results clearly evade the natural proofs barrier, because they give 
lower bounds against strong circuit classes such as P/poly, or the set of all size- 
n* circuits for fixed k. This is not so surprising when we observe that the proofs 
build on the simpler results from Sect. 6.1, which already used diagonalization to 
evade the natural proofs barrier. 

What is more interesting is that these results also evade the relativization barrier. 
Of course, one might guess as much, after noticing that the proofs use the non- 
relativizing IP = PSPACE theorem. But to show rigorously that the circuit lower 
bounds themselves fail to relativize, one needs to construct oracles relative to which 
the circuit lower bounds are false. This is done by the following results, whose 
somewhat elaborate proofs we omit: 


Theorem 55 (Buhrman-Fortnow-Thierauf [56]).. There exists an oracle A such 
that MAgxp © P4/poly. 


Theorem 56 (Aaronson [4]). There exists an oracle A relative to which all lan- 
guages in PP have linear-sized circuits. 


The proofs of both of these results also easily imply that there exists an oracle 
relative to which all MA promise problems have linear-sized circuits. 

The bottom line is that, by combining non-relativizing results like IP = PSPACE 
with non-naturalizing results like EXPSPACE ¢ P/poly, we can prove interesting 
circuit lower bounds that neither relativize nor naturalize. So then why couldn’t we 
keep going, and use similar techniques to prove NEXP ¢ P/poly, or even P 4 NP? 
Is there a third barrier, to which even the arithmetization-based lower bounds are 
subject? 


39 Actually, for this proof one does not really need either Toda’s Theorem, or the slightly-nontrivial 
result that D° does not have circuits of size n*. Instead, one can just argue directly that at any 
rate, P*P does not have circuits of size n‘, using a slightly more careful version of the argument of 


Theorem 37. For details see Aaronson [4]. 


40In complexity theory, a promise problem is a pair of subsets Ilys, no € {0, 1}* with Ilygs N 
IIno = ©. An algorithm solves the problem if it accepts all inputs in Tygs and rejects all inputs 
in IIno. Its behavior on inputs neither in ITygs nor Ino (..e., inputs that “violate the promise’’) can 
be arbitrary. A typical example of a promise problem is: given a Boolean circuit C, decide whether 
C accepts at least 2/3 of all inputs x € {0, 1}” or at most 1/3 of them, promised that one of those 
is true. This problem is in BPP (or technically, PromiseBPP). The role of the promise here is to 
get rid of those inputs for which random sampling would accept with probability between 1/3 and 
2/3, violating the definition of BPP. 


66 S. Aaronson 


6.3.3 The Algebrization Barrier 


In 2008, Avi Wigderson and I [10] showed that, alas, there is a third barrier. 
In particular, while the arithmetic techniques used to prove IP = PSPACE do 
evade relativization, they crash up against a modified version of relativization 
that is “wise” to those techniques. We called this modified barrier the algebraic 
relativization or algebrization barrier. We then showed that, in order to prove 
P # NP—or for that matter, even to prove NEXP ¢ P/poly, or otherwise go even 
slightly beyond the results of Sect. 6.3.2—one would need techniques that evade the 
algebrization barrier (and also, of course, evade natural proofs). 

In more detail, we can think of an oracle as just an infinite collection of Boolean 
functions, f, : {0, 1}” — {0,1} for each n. Now, by an algebraic oracle, we mean 
an oracle that provides access not only to f,, for each n, but also to a low-degree 
extension fy, : F’ > F of f, over some large finite field F. This extension must have 
the property that fy (x) = f, (x) for all x € {0, 1}", and it must be a polynomial of 
low degree—say, at most 2n. But such extensions always exist, and querying them 
outside the Boolean cube {0, 1}” might help even for learning about the Boolean 
part fr. 

The point of algebraic oracles is that they capture what we could do if we had a 
formula or circuit for f,,, and were willing to evaluate it not only on Boolean inputs, 
but on non-Boolean ones as well, in the manner of IP = PSPACE. In particular, we 
saw in Sect. 6.3.1 that, given (say) a 3SAT formula ¢, we can “lift” @ to a low-degree 
polynomial ¢ over a finite field F by reinterpreting the AND, OR, and NOT gates 
in terms of field addition and multiplication. So if we’re trying to capture to power 
of arithmetization relative to an oracle function f,, then it stands to reason that we 
should also be allowed to lift f,. 

Once we do so, we find that the non-relativizing results based on arithmetization, 
such as IP = PSPACE, relativize with respect to algebraic oracles (or “algebrize’). 
That is: 


Theorem 57. Ip* = PSPACE® for all algebraic oracles A. Likewise, PSPAC Et G 


P4/poly implies PSPACE* = MA* for all algebraic oracles ‘A, and so on for all 
the interactive proof results. 


The intuitive reason is that, any time (say) Arthur needs to arithmetize a formula 
gy containing A-oracle gates in an interactive protocol, he can handle non-Boolean 
inputs to the A-oracle gates by calling A. 

As a consequence of Theorem 57, the circuit lower bounds of Sect. 6.3.2 are 
algebrizing as well: for example, for all algebraic oracles ‘A, we have MAgExp ¢ 


P4/poly, and PP“ does not have size-n‘ circuits with ‘A-oracle gates. 


Admittedly, the original paper of Aaronson and Wigderson [10] only managed 
to prove a weaker version of Theorem 57. It showed, for example, that for all 
algebraic oracles A, we have PSPACE’ C IP4, and MAgyp Z P4/poly. As a 
result, it had to define algebrization in a convoluted way, where some complexity 
classes received the algebraic oracle A while others received only the “original” 
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oracle A, and which received which depended on what kind of result one was 
talking about (e.g., an inclusion or a separation). Shortly afterward, Impagliazzo, 
Kabanets, and Kolokolova [112] managed to fix this defect of algebrization, 
proving Theorem 57 even when all classes receive the same algebraic oracle A, 
but only at the cost of jettisoning Aaronson and Wigderson’s conclusion that any 
proof of NEXP ¢ P/poly will require non-algebrizing techniques. Very recently, 
Aydinhoglu and Bach [174] showed how to get the best of both worlds, with a 
uniform definition of algebrization and the conclusion about NEXP vs. P/poly. 

In any case, the main point of [10] was that to prove P # NP, or otherwise go 
further than the circuit lower bounds of Sect. 6.3.2, one will need non-algebrizing 
techniques: techniques that fail to relativize in a “deeper” way than IP = PSPACE 
fails to relativize. Let us see why this is true for P ~ NP. 


Theorem 58 (Aaronson-Wigderson [10]). There exists an algebraic oracle A 


such that PA = NP“. As a consequence, any proof of P # NP will require non- 
algebrizing techniques. 


Proof. We can just let A be any PSPACE-complete language, and then let A be 
its unique extension to a collection of multilinear polynomials over F (that is, 
polynomials in which no variable is ever raised to a higher power than 1). The 
key observation is that the multilinear extensions are themselves computable in 
PSPACE. So we get a PSPACE-complete oracle A, which collapses P and NP for 
the same reason as in the original argument of Baker et al. [34] (see Theorem 40). 
B 


Likewise, Aaronson and Wigderson [10] showed that any proof of P = NP, 
or even P = BPP, would need non-algebrizing techniques. They also proved the 
following somewhat harder result, whose proof we omit. 


Theorem 59 ((10]). There exists an algebraic oracle A such that NEXP4 c 
P4/poly. As a consequence, any proof of NEXP ¢ P/poly will require non- 
algebrizing techniques. 


Note that this explains almost exactly why progress stopped where it did: 
MAexp £ P/poly can be proved with algebrizing techniques, but NEXP ¢ P/poly 
cannot be. 

I should mention that Impagliazzo et al. [112] gave a logical interpretation of 
algebrization, extending the logical interpretation of relativization given by Arora et 
al. [28]. In particular, Impagliazzo et al. show that the algebrizing statements can be 
seen as all those statements that follow from “algebrizing axioms for computation,” 
which include basic closure properties, and also the ability to lift any Boolean 
computation to a larger finite field. Statements like P # NP are then provably 
independent of the algebrizing axioms. 


68 S. Aaronson 
6.4 Ironic Complexity Theory 


There is one technique that has had some striking recent successes in proving circuit 
lower bounds, and that bypasses the natural proofs, relativization, and algebrization 
barriers. This technique might be called “ironic complexity theory.” It uses the 
existence of surprising algorithms in one setting to show the nonexistence of 
algorithms in another setting. It thus reveals a “duality” between upper and lower 
bounds, and reduces the problem of proving impossibility theorems to the much 
better-understood task of designing efficient algorithms.*! 

At a conceptual level, it is not hard to see how algorithms can lead to lower 
bounds. For example, suppose someone discovered a way to verify arbitrary 
exponential-time computations efficiently, thereby proving NP = EXP. Then as 
an immediate consequence of the Time Hierarchy Theorem (P 4 EXP), we would 
get P ~ NP. As another example, suppose someone discovered that every language 
in P had linear-size circuits. Then P = NP would imply that every language in PH 
had linear-size circuits—but since we know that is not the case (see Sect. 6.1), we 
could again conclude that P # NP. Conversely, if someone proved P = NP, that 
wouldn’t be a total disaster for lower bounds research: at least it would immediately 
imply EXP ¢ P/poly (via EXP = EXPNP), and the existence of languages of P 
and NP that don’t have linear-size circuits! 

Examples like this can be multiplied, but there is an obvious problem with 
them: they each show a separation, but only assuming a collapse that is considered 
extremely unlikely to happen. However, recently researchers have managed to use 
surprising algorithms that do exist, and collapses that do happen, to achieve new 
lower bounds. In this section I'll give two examples. 


6.4.1 Time-Space Tradeoffs 


At the moment, no one can prove that solving 3SAT requires more than linear time 
(let alone exponential time!), on realistic models of computation like random-access 
machines.” Nor can anyone prove that solving 3SAT requires more than O (log n) 
bits of memory. But the situation is not completely hopeless: at least we can prove 
there’s no algorithm for 3SAT that uses both linear time and logarithmic memory! 
Indeed, we can do better than that. 

A bit of background: just as one can scale PSPACE up to EXPSPACE and 
so on, one can also scale PSPACE down to LOGSPACE, which the class of 
languages L decidable by a Turing machine that uses only O (log n) bits of read/write 


“\Indeed, the hybrid circuit lower bounds of Sect. 6.3.2 could already be considered examples of 
ironic complexity theory. In this section, we discuss other examples. 

“On unrealistic models such as one-tape Turing machines, one can prove up to Q (n?) lower 
bounds for 3SAT and many other problems (even recognizing palindromes), but only by exploiting 
the fact that the tape head needs to waste a lot of time moving back and forth across the input. 
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memory, in addition to a read-only memory that stores the n-bit input itself. We 
have LOGSPACE C P, for the same simple reason why PSPACE © EXP (see 
Proposition 15). We also have LOGSPACE #4 PSPACE by the Space Hierarchy 
Theorem. On the other hand, no one has proven even that LOGSPACE # NP. 

Now, a “time-space tradeoff theorem” shows that any algorithm to solve some 
problem must use either more than T time or else more than S' space. Let me state 
perhaps the canonical time-space tradeoff theorem for 3S AT (though it’s since been 
improved): 


Theorem 60 (Lipton-Viglas [149]).. No random-access machine can solve 3SAT 
simultaneously in n¥?-* time and n°) space, for any € > 0. 


Here, a “random-access machine” means a machine that can access an arbitrary 
memory location in O(1) time, as usual in practical programming. This makes 
Theorem 60 stronger than one might have assumed: it holds not merely for 
unrealistically weak models such as Turing machines, but for realistic models as 
well. Also, again, “n°) space” means that we get the n-bit 3SAT instance itself in a 
read-only memory, and also get n°“! bits of auxiliary read/write memory. 

While Theorem 60 is obviously a far cry from P 4 NP, it does rely essentially 
on 3SAT being NP-complete: we don’t yet know how to prove analogous results 
for matching, linear programming, or other natural problems in P.*? This makes 
Theorem 60 fundamentally different from (say) the PARITY ¢ AC° result of 
Sect. 6.2.3. 

Let DTISP (7, S) be the class of languages decidable by an algorithm, running 
on a RAM machine, that uses O (T) time and O (S) space. Then Theorem 60 can be 
stated more succinctly as 


3Sat ¢ DTISP (ae ni?) 


for alle > 0. 

At a high level, Theorem 60 is proved by assuming the opposite, and then deriv- 
ing stranger and stranger consequences until one ultimately gets a contradiction with 
the Nondeterministic Time Hierarchy Theorem (Theorem 35). There are three main 
ideas that go into how one does this. The first idea is a tight version of the Cook- 
Levin Theorem (Theorem 2). In particular, one can show, not merely that 3SAT is 
NP-complete, but that 3SAT is complete for NTIME (7) (that is, nondeterministic 


3 On the other hand, by proving size-depth tradeoffs for so-called branching programs, researchers 
have been able to obtain time-space tradeoffs for certain special problems in P. Unlike the 3SAT 
tradeoffs, the branching program tradeoffs involve only slightly superlinear time bounds; on the 
other hand, they really do represent a fundamentally different way to prove time-space tradeoffs, 
one that makes no appeal to NP-completeness, diagonalization, or hierarchy theorems. As one 
example, in 2000 Beame et al. [37], building on earlier work by Ajtai [17], used branching 
programs to prove the following: there exists a problem in P, based on binary quadratic forms, 
for which any RAM algorithm (even a nonuniform one) that uses n!~°“ space must also use 


Q (n- flog n/log log n) time. 
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linear-time on a RAM machine) under nearly linear-time reductions. That means 
that, to prove Theorem 60, it suffices to prove a non-containment of complexity 
classes: 


NTIME (n) ¢ DTISP (aia) 


for all e > 0. 

The second idea is called “trading time for alternations.” Consider a deterministic 
computation that runs for T steps and uses S bits of memory. Then we can “chop the 
computation up” into k blocks, B,,..., Bx, of T/k steps each. The statement that the 
computation accepts is then equivalent to the statement that there exist S-bit strings 
X0,...,Xz, Such that 


(i) xo is the computation’s initial state, 
(ii) for alli € {1,...,k}, the result of starting in state x;-; and then running for 
T/k steps is x;, and 
(iii) x; 1s an accepting state. 


We can summarize this as 
T 
DTISP (7, S) © S2TIME (sx-+ a) ; 


where the Xz means that we have two alternating quantifiers: an existential 
quantifier over x;,...,x,, followed by a universal quantifier over i. Choosing k := 
J T/S to optimize the bound then gives us 


DTISP (7, S) ¢ Z»TIME (7S) , 
So in particular, 
DTISP (n°, 2) C Yo TIME (een) 


The third idea is called “trading alternations for time.” If we assume by way of 
contradiction that 


NTIME (n) ¢ DTISP (Ge n) C TIME (n°), 
then in particular, for all b > 1, we can remove the inner quantifier to get 
D2 TIME (n”) C NTIME (n”) . 
So putting everything together, if we consider a constant c > 1, and use padding 


(as in Proposition 17) to talk about NTIME (n’) rather than NTIME (n), then the 
starting assumption that 3SAT is solvable in n°~* time and n°“ space implies that 


NTIME (n*) < DTISP (n,n) 
S Ey TIME (n°) 


C NTIME (ee) 


But if c? < 2, then this contradicts the Nondeterministic Time Hierarchy Theorem 
(Theorem 35). This completes the proof of Theorem 60. Notice that the starting 
hypothesis about 3SAT was applied not once but twice, which was how the final 
running time became n°. 

Later, using a more involved argument, Fortnow and van Melkebeek [81] 
improved Theorem 60, to show that 3SAT can’t be solved by a RAM machine 
14/5 

2 


using n?~* time and n?” space, where ¢ = ~ 1.618 is the golden ratio. 


Subsequently Williams [234] improved the time bound in this result to n¥7*~®, 
and then [235] to n?°°8*/7—*, with the help of computer search for the optimal 
sequence of “moves” in an alternation-trading argument. In 2012, however, Buss 
and Williams [63] showed that no alternation-trading proof can possibly improve 
that exponent beyond the peculiar constant 2cos/7 ~ 1.801. There have been 
many related time-space tradeoff results, including for #P-complete and PSPACE- 
complete problems, but I won’t cover them here (see van Melkebeek [152] for a 
survey). 

Alternation-trading has had other applications in complexity theory, other than 
to time-space tradeoffs. In particular, it played a key role in a celebrated 1983 result 
of Paul et al. [177], whose statement is tantalizingly similar to P 4 NP. 


Theorem 61 (Paul et al. [177]). TIME(2) #4 NTIME(n), if we define these 


classes using multiple-tape Turing machines. 


In this case, the key step was to show, via a clever combinatorial argument 
involving “pebble games,” that for multi-tape Turing machines, deterministic linear 
time can be simulated in X4TIME (f (n)), for some f that’s slightly sublinear. This, 
combined with the assumption TIME (n) = NTIME (n), is then enough to produce 
a contradiction with a time hierarchy theorem. 

What can we say about barriers? All the results mentioned above clearly evade 
the natural proofs barrier, because they ultimately rely on diagonalization, and 


(more to the point) because classes like TIME (n) and DTISP nv?, n°) ) contain 


plausible pseudorandom function candidates. Whether they evade the relativization 
barrier (let alone algebrization) is a trickier question; it depends on subtle details of 
the oracle access mechanism. There are some definitions of the classes TIME (n)*, 
DTISP (T, Si, and so on under which these results relativize, and others under 
which they don’t: for details, see for example Moran [156]. 

On the definitions that cause these results not to relativize, the explanation for 
how is that the proofs “look inside” the operations of a RAM machine or a multi- 
tape Turing machine just enough for something to break down if certain kinds 
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of oracle calls are present. To illustrate, in the proof of Theorem 60 above, we 
nondeterministically guessed the complete state of the machine at various steps in 
its execution, taking advantage of the fact that the state was an n°)-bit string. This 
wouldn’t have worked had there been an n-bit query written onto an oracle tape 
(even if the oracle tape were write-only). Likewise, in the proof of Theorem 61, 
the combinatorial pebble arguments use specific properties of multi-tape Turing 
machines that might fail for RAM machines, let alone for oracle machines. 
Because their reasons for failing to relativize have nothing to do with lifting 
to large finite fields, I conjecture that, with a suitable oracle access mechanism, 
Theorems 60 and 61 would also be non-algebrizing. But this remains to be shown. 


6.4.2 NEXP ¢ ACC 


In Sect. 6.2.4, we saw how Smolensky [209] and Razborov [189] managed to prove 
strong lower bounds against the class AC” [p], or constant-depth, polynomial-size 
circuits of AND, OR, NOT, and MOD p gates, where p is a prime. This left the 
frontier of circuit lower bounds as AC° [m], where m is a composite. Slightly more 
ambitiously, we could hope for lower bounds against a complexity class called 
ACC, which consists of all languages decidable by constant-depth, polynomial-size 
circuits with AND, OR, NOT, and MOD m gates for any m (where we can mix 
multiple m’s in the same circuit). 

Meanwhile, we saw in Sect. 6.3 how Buhrman et al. [56] proved that MAgexp ¢ 
P/poly, but how this cannot be extended even to NEXP ¢ P/poly using algebrizing 
techniques. Indeed, it remains open even to prove NEXP ¢ TC’. 

This state of affairs—and its continuation for decades—helps to explain why 
many theoretical computer scientists were electrified when Ryan Williams proved 
the following in 2011. 


Theorem 62 (Williams [242]). NEXP ¢ ACC. 


If we compare it against the ultimate goal of proving NP ¢ P/poly, Theorem 62 
looks almost laughably weak: it shows only that Nondeterministic Exponential 
Time, a class vastly larger than NP, is not in ACC, a circuit class vastly smaller 
than P/poly. But a better comparison is against where we were before. The proof 
of Theorem 62 was noteworthy not only because it defeats all the known barriers 
(relativization, algebrization, and natural proofs), but also because it brings together 
almost all known techniques in Boolean circuit lower bounds, including diagonal- 
ization, the polynomial method, interactive proof results, and ironic complexity 
theory. So it is worth at least sketching the elaborate proof, so we can see how a 
lower bound at the current frontier operates. (For further details, I recommend two 
excellent expository articles by Williams himself [236, 240].) 

At a stratospherically high level, the proof of Theorem 62 is built around the 
Nondeterministic Time Hierarchy Theorem, following a program that Williams had 
previously laid out in [238]. More concretely, we assume that NTIME (2”) c ACC. 
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We then use that assumption to show that NTIME (2”) = NTIME (2” / nt) for some 
positive k: a slight speedup of nondeterministic machines, but enough to achieve a 
contradiction with Theorem 35. 

How do we use the assumption NTIME (2) c ACC to violate the Nondeter- 
ministic Time Hierarchy Theorem? The key to this—and this is where “ironic 
complexity theory” enters the story—is a faster-than-brute-force algorithm for a 
problem called ACCSAT. Here we are given as input a description of ACC circuit 
C, and want to decide whether there exists an input x € {0, 1%” such that C (x) = 1. 
The core of Williams’s proof is the following straightforwardly algorithmic result. 


Lemma 63 (Williams [242]). There is a deterministic algorithm that solves ACC- 
SAT, for ACC circuits of depth d with n inputs, in 2"-2(”") time, for some constant 
6 > 0 that depends on d. 


The proof of Lemma 63 is itself a combination of several ideas. First, one appeals 
to a powerful structural result of Yao [246], Beigel-Tarui [38], and Allender-Gore 
[23] from the 1990s, which shows that functions in ACC are representable in terms 
of low-degree polynomials. 


Lemma 64 ([23, 38, 246]). Let f : {0,1}” — {0,1} be computable by an ACC 
circuit of size s and depth d. Then f (x) can be expressed as g (p(x)), where p : 


{0,1}" — N is a polynomial of degree log?) OM) 5 


s that is a sum of exp (log 
monomials with coefficients of 1, and g : N > {0, 1} is some efficiently computable 


function. (Here the constant in the big-O depends on d.) 


The proof of Lemma 64 uses some elementary number theory, and is closely 
related to the polynomial method from Sect. 6.2.4, by which one shows that any 
AC° [p] function can be approximated by a low-degree polynomial over the finite 
field F,.4 

Next, one devises a faster-than-brute-force algorithm that, given a function 
g (p (x)) as above, decides whether there exists anx € {0, 1}” such that g (p (x)) = 1. 
The first step is to give an algorithm that constructs a table of all 2” values of p (x), 
for all the 2” possible values of x, in (2" + s°\)) n° time, rather than the O (2”s) 
time that one would need naively. (In other words, this algorithm uses only n?) 
time on average per entry in the table, rather than O (s) time—an improvement if s 
is superpolynomial.) Here there are several ways to go: one can use a fast rectangular 
matrix multiplication algorithm due to Coppersmith [69], but one can also just use 
a dynamic programming algorithm reminiscent of the Fast Fourier Transform. 

Now, by combining this table-constructing algorithm with Lemma 64, we can 
immediately solve ACCSAT, for an ACC circuit of size 5 = Qn) in 2"n° time, 
which is better than the O (2”s) time that we would need naively. However, this still 


isn’t good enough to prove Lemma 63, which demands a gn-2(n') algorithm. So 


“4Interestingly, both the polynomial method and the proof of Lemma 64 are also closely related to 
the proof of Toda’s Theorem (Theorem 13), that PH C pH | 
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ore is a further trick: given an ACC circuit C of size n°, one first “shaves off” 
n° of the n variables, building a new ACC circuit C’ that takes as input the n — n° 
eee variables, and that computes the OR over all Qn possible assignments to 
the n® shaved variables.*> The new circuit C’ has size 2°(” ), sO one can construct 
the table, and thereby solve ACCSar for C’ (and hence for C), in time gran") 

Given Lemma 63, as well as the starting assumption NTIME (2”) Cc ACC, there 
is still a lot of work to do to prove that NTIME (2”) = NTIME (2"/n*). Let me 
summarize the four main steps: 


(1) One first uses a careful, quantitative version of the Cook-Levin Theorem 
(Theorem 2), to reduce the problem of simulating an NTIME (2”) machine to a 
problem called SUCCINCT3SAT. In that problem, one is given a circuit C whose 
truth table encodes an exponentially large 3SAT instance @, and the problem is 
to decide whether or not ¢ is satisfiable. 

One next appeals to a result of Impagliazzo, Kabanets, and Wigderson [113], 
which says that if NEXP C P/poly, then the satisfying assignments for satisfi- 
able SUCCINCT3SAT instances can themselves be constructed by polynomial- 
size circuits. 

One massages the result (2) to get a conclusion about ACC: very roughly 
speaking, if NEXP Cc ACC, then given any satisfiable SUCCINCT3SAT instance 
®, there is an equivalent instance ® in which the circuit C is an ACC circuit. 
Furthermore, a satisfying assignment for ®’ can itself be constructed by an 
ACC circuit W; and the problems of verifying that ® and ©’ are equivalent, 
and that W does indeed encode a satisfying assignment for ®’, can be solved in 
slightly less than 2” time nondeterministically, if we use the fact (Lemma 63) 
that ACCSAT is solvable in 2"-2") time. Note that in this argument (which is 
the most complicated part of the proof), one uses the assumption NEXP c ACC 
not just once but several times. 

Putting everything together, we get that NTIME (2”) machines can be reduced 
to SUCCINCT3SAT instances, which can then (assuming NEXP c ACC, and 
using the ACCSAT algorithm) be decided in NTIME (2” /n*) for some pos- 
itive k. But that contradicts the Nondeterministic Time Hierarchy Theorem 
(Theorem 35). 
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Let me mention some improvements and variants of Theorem 62. Already in 
his original paper [242], Williams noted that the proof actually yields a stronger 
result, that NTIME (2”) has no ACC circuits of “third-exponential” size: that is, 
size f (n) where f (f (f (”))) grows exponentially. He also gave a second result, 
that TIME (2”) XP that is, deterministic exponential time with an NP oracle—has 
no ACC circuits of size 2”. More recently, Williams has extended Theorem 62 
to show that NTIME (2”) /1 1 coNTIME (2”) /1 (where the /1 denotes 1 bit of 


“Curiously, this step can only be applied to the ACC circuits themselves, which of course allow 
OR gates. It cannot be applied to the Boolean functions of low-degree polynomials that one derives 
from the ACC circuits. 
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nonuniform advice) does not have ACC circuits of size n'°£” [239], and also to show 
that even ACC circuits of size n'°2” with threshold gates at the bottom layer cannot 
compute all languages in NEXP [241]. 

At this point, it’s appropriate to make some general remarks about the proof of 
Theorem 62 and the prospects for pushing it further. First of all, why did this proof 
only yield lower bounds for functions in the huge complexity class NEXP, rather 
than EXP or NP or even P? The short answer is that, in order to prove that a class C 
is not in ACC via this approach, we need to use the assumption C C ACC to violate a 
hierarchy theorem for C-like classes. However, there’s a bootstrapping problem: the 
mere fact that C has small ACC circuits doesn’t imply that we can find those circuits 
in a C-like class, in order to obtain the desired contradiction. When C = NEXP, we 
can use the nondeterministic guessing power of the NTIME classes simply to guess 
the small ACC circuits for NEXP, but even when C = EXP this approach seems to 
break down. 

A second question is: what in Williams’s proof was specific to ACC? Here the 
answer is that the proof used special properties of ACC in one place only: namely, 
in the improved algorithm for ACCSAT (Lemma 63). This immediately suggests 
a possible program to prove NEXP ¢ C for larger and larger circuit classes C. 
For example, let TC°SAT be the problem where we are given as input a TC° circuit 
C (that is, a neural network, or constant-depth circuit of threshold gates), and we 
want to decide whether there exists an x € {0, 1}” such that C (x) = 1. Then if we 
could solve TC°SAT even slightly faster than brute force—say, in O (2” / nt) time 
for some positive kK—Williams’s results would immediately imply NEXP ¢ TC°.*° 
Likewise, recall from Sect.2.1 that CIRCUITSAT is the satisfiability problem for 
arbitrary Boolean circuits. If we had an O (2”/n‘) algorithm for CIRCUITSAT, then 
Williams’s results would imply the long-sought NEXP ¢ P/poly. 

A third question is: how does the proof of Theorem 62 evade the known barriers? 
Because of the way the algorithm for ACCSAT exploits the structure of ACC circuits, 
we shouldn’t be surprised if the proof evades the relativization and algebrization 
barriers. And indeed, using the techniques of Wilson [243] and of Aaronson and 
Wigderson [10], one can easily construct an oracle A such that NEXP4 c ACC4, 


and even an algebraic oracle A such that NEXP4 c ACC‘, thereby showing 
that NEXP ¢ ACC is a non-relativizing and non-algebrizing result. Meanwhile, 
because it uses diagonalization (in the form of the Nondeterministic Time Hierarchy 
Theorem), we might say that the proof of Theorem 62 has the “capacity” to evade 
natural proofs. On the other hand, as we alluded to in Sect. 6.2.5, it’s not yet clear 
whether ACC is powerful enough to compute pseudorandom functions—and thus, 
whether it even has a natural proofs barrier to evade! The most we can say is that if 
ACC has a natural proofs barrier, then Theorem 62 evades it. 


4©Very recently, Kane and Williams [119] managed to give an explicit Boolean function that 
requires depth-2 threshold circuits with Q (n/ 2/ log? n) gates. However, their argument does not 
proceed via a better-than-brute-force algorithm for depth-2 TC°SAT; the latter problem appears to 
remain open. 
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Given everything we saw in the previous sections, a final question arises: is there 
some fourth barrier, beyond relativization, algebrization, and natural proofs, which 
will inherently prevent even Williams’s techniques from proving P 4 NP, or even 
(say) NEXP ¢ TC°? One reasonable answer is that this question is premature: in 
order to identify the barriers to a given set of techniques, we first need to know 
formally what the techniques are—i.e., what properties all the theorems using those 
techniques have in common—but we can’t know that until the techniques have had 
a decade or more to solidify, and there are at least three or four successful examples 
of their use. Another answer is that yes, there is (or might be) an obvious “barrier” 
to the continuation of Williams’s program. This barrier is that a faster-than-brute- 
force algorithm for TC°SAT, let alone for more general problems like CIRCUITSAT, 
might simply not exist. If there’s some threshold of expressive power in a circuit, 
beyond which brute-force search really does become the fastest possible algorithm 
for circuit satisfiability, then ironic complexity theory (or at least this incarnation of 
it) will run out of the irony that it needs as fuel. 

Even if so, though, I see Theorem 62 as having contributed something important 
to the quest to prove P # NP, by demonstrating just how much nontrivial work 
can get done, and how many barriers can be overcome, along the way to applying a 
1960s-style hierarchy theorem. Williams’s result makes it possible to imagine that, 
in the far future, P 4 NP might be proved by assuming the opposite, then deriving 
stranger and stranger consequences using thousands of pages of mathematics barely 
comprehensible to anyone alive today—and yet still, the coup de grace will be a 
diagonalization argument, barely different from what Turing did in 1936. 


6.5 Arithmetic Complexity Theory 


Besides Turing machines and Boolean circuits acting on bits, there’s another kind of 
computation that has enormous relevance to the attempt to prove P 4 NP. Namely, 
we can consider computer programs that operate directly on elements of a field, 
such as the reals or complexity. Perhaps the easiest way to do this is via arithmetic 
circuits, which take as input a collection of elements x;,...,x, of a field F,*’ and 
whose operations consist of adding or multiplying any two previous elements— 
or any previous element and any scalar from F—to produce a new F-element. 
We then consider the minimum number of operations needed to compute some 
polynomial g : F” — F, as a function of n. For concreteness, one can think of 
* as the reals R, although we are most interested in algorithms that work over any 
*, and that compute g as a formal polynomial, rather than as just a function over a 
particular F.** 


471°1] restrict to fields here for simplicity, but one can also consider (e.g.) rings. 


48To clarify, 0 and 2x are equal as functions over the finite field F2, but not equal as formal 
polynomials. 
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At first glance, arithmetic circuits seem more powerful than Boolean circuits, 
because they have no limit of finite precision: for example, an arithmetic circuit 
could multiply mz and e in a single time step. From another angle, however, 
arithmetic circuits are weaker, because they have no facility (for example) to extract 
individual bits from the binary representations of the F elements: they can only 
manipulate them as F elements. In general, the most we can say is that, if an 
input has helpfully been encoded using the elements 0,1 ¢€ F only, then an 
arithmetic circuit can simulate a Boolean one, by using x — 1 — x to simulate 
NOT, multiplication to simulate Boolean AND, and so on. But for arbitrary inputs, 
such a simulation might be impossible. 

Thus, arithmetic circuits represent a different kind of computation: or rather, a 
generalization of the usual kind, since we can recover ordinary Boolean computation 
by setting F = F,. A major reason to focus on arithmetic circuits is that it 
often seems easier—or better, less absurdly hard!—to understand circuit size in the 
arithmetic setting than in the Boolean one. The usual explanation given for this is 
the so-called “yellow books argument”: arithmetic complexity brings us closer to 
continuous mathematics, about which we have centuries’ worth of deep knowledge 
(e.g., algebraic geometry and representation theory) that’s harder to apply in the 
Boolean case. 

One remark: in the rest of the section, I'll talk exclusively about arithmetic 
circuit complexity: that is, about nonuniform arithmetic computations, and the 
arithmetic analogues of questions such as NP versus P/poly (see Sect. 5.2). But 
it’s also possible to develop a theory of arithmetic Turing machines, which (roughly 
speaking) are like arithmetic circuits except that they’re uniform, and therefore need 
loops, conditionals, memory registers, and so on. See the book of Blum, Cucker, 
Shub, and Smale (BCSS) [46] for a beautiful exposition of this theory. In the BCSS 


: 2 : : 
framework, one can ask precise analogues of the P = NP question for Turing 
machines over arbitrary fields F, such as R or C, recovering the “ordinary, Boolean” 


ty 
P = NP question exactly when F is finite. At present, no implications are known 
2 


among the P as NP, the Pp = NPa, and the Pc as NP questions, although it’s 
known that Pc # NPc implies NP ¢ P/poly (see for example Biirgisser [57, 
Chap. 8]).* 

The problems of proving Pg #4 NPp and Pc # NP¢ are known to be closely 
related to the problem of proving arithmetic circuit lower bounds, which we’ll 
discuss in the following sections. I can’t resist giving one example of a connection, 
due to BCSS [46]. Given a positive integer n, let t (n) be the number of operations 
in the smallest arithmetic circuit that takes the constant | as its sole input, and 
that computes n using additions, subtractions, and multiplications. For example, we 
have 


2 9 
The central difference between the Pp = NPp and Pc = NPc questions is simply that, because 
R is an ordered field, one defines Turing machines over R to allow comparisons (<, <) and 
branching on their results. 


78 S. Aaronson 


« t(2)=I1vial+1, 
« 7(3)=2vial+1+1, 
* (4) =2via(14+1)’,... 
Also, let t* (n) be the minimum of t (kn) over all positive integers k. 


Theorem 65 (BCSS [46]). Suppose t* (n!) grows faster than (log n)?, Then 
Po # NPo.? 


6.5.1 Permanent Versus Determinant 
The central problem studied in arithmetic complexity theory—if you like, the 
arithmetic analogue of the NP vs. P/poly problem—is the permanent versus 


determinant problem. The problem concerns the following two functions of ann xn 
matrix X € F"": 


Per(X) = )°] | xiow, 


o€S, i=1 
n 
Det (X) = > (—1)28) | [xico. 
oESy i=1 


Despite the similarity of their definitions—they are identical apart from the 
(—1)°8")—the permanent and determinant have some dramatic differences. The 
determinant is computable in polynomial time, for example by using Gaussian 
elimination. (Indeed, the determinant is computable in O(n”) time, where 
w € [2, 2.373] is the matrix multiplication exponent; see Sect. 4.) The determinant 
has many other interpretations—for example, the product of X’s eigenvalues, and 
the volume of the parallelepiped spanned by its row vectors—giving it a central role 
in linear algebra and geometry. 

By contrast, Valiant [226] proved in 1979 that the permanent is #P-complete. 
Thus, a polynomial-time algorithm for the permanent would imply even more than 
P = NP: it would yield an efficient algorithm, not merely to solve NP-complete 
problems, but even to count how many solutions they have. In some sense, the 
#P-completeness of the permanent explains why Per, unlike Det, has no simple 
geometric or linear-algebraic interpretations: if such interpretations existed, then 
presumably they would imply P = P#?. 

In the arithmetic model, there exist arithmetic circuits of size O (n’), and even 
size O(n®), that compute Det (X) as a formal polynomial in the entries of X, and 
that work over an arbitrary field F. By contrast, Valiant conjectured the following. 


5°Tn later work, Biirgisser [59] showed that the same conjecture about t* (n!) (called the r- 
conjecture) would also imply Valiant’s Conjecture 66, that the permanent has no polynomial-size 
arithmetic circuits. 
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Conjecture 66 (Valiant’s Conjecture). Any arithmetic circuit for Per (X) requires 
size superpolynomial in n, over any field of characteristic other than 2.>'*? 


Biirgisser [58] showed that, if Conjecture 66 fails over any field of positive 
characteristic, or if it fails over any field of characteristic zero and the Generalized 
Riemann Hypothesis holds, then P*? C P/poly, and hence NP C P/poly.°* (The 
main difficulty in proving this result is just that an arithmetic circuit might have very 
large constants hardwired into it.) On the other hand, no converses to this result 
is currently known. It’s conceivable, for example, that we could have P = P#P 
for some “inherently Boolean” reason, even if the permanent required arithmetic 
circuits of exponential size. To put it another way, proving Conjecture 66 could serve 
as an “arithmetic warmup’”—some would even say an “arithmetic prerequisite”—to 
proving Boolean separations such as P*P ¢ P/poly and P 4 NP. 

Better yet, Conjecture 66 turns out to be implied by (and nearly equivalent to) an 
appealing mathematical conjecture, which makes no direct reference to computation 
or circuits. Let’s say that the m x n permanent linearly embeds into the m x m 
determinant, if it is possible to express Per (X) (for an n x n matrix X € F”*") 
as Det (L (X)), where L (X) is an m x m matrix each of whose entries is an affine 
combination of the entries of X. Then let D (n) be the smallest m such that the n x n 
permanent linearly embeds into the m x m determinant. 

Grenet [92] showed that D (n) < 2”. By contrast, the best current lower bound on 
D (n) is quadratic, and was proved by Mignon and Ressayre [153] in 2004, following 
a long sequence of linear lower bounds: 


Theorem 67 (Mignon and Ressayre [153]). D (n) > n /2. 
(Actually, Mignon and Ressayre proved Theorem 67 only for fields of character- 


istic 0. Their result was then extended to all fields of characteristic other than 2 by 
Cai et al. [64] in 2008.) 

The basic idea of the proof of Theorem 67 is to consider the Hessian matrix of 
a polynomial p : FY — F, or the matrix of second partial derivatives, evaluated at 
some particular point Xp € FY: 


Ox] ox. ‘N 


HD yy Se) 
Oxy 
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2p ep 
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51Fields of characteristic 2, such as F , are a special case: there, the permanent and determinant 
are equivalent, so in particular Per (X) has polynomial-size arithmetic circuits. 


>In the literature, Conjecture 66 is often called the VP # VNP conjecture, with VP and VNP 
being arithmetic analogues of P and NP respectively. I won’t use that terminology in this survey, 
for several reasons: (1) VP is arguably more analogous to NC than to P, (2) VNP is arguably 
more analogous to #P than to NP, and (3) Conjecture 66 is almost always studied as a nonuniform 
conjecture, more analogous to NP ¢ P/poly than to P # NP. 


53Indeed, #P would even have polynomial-size circuits of depth log? n. 
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Here we mean the “formal” partial derivatives of p: even if F is a finite field, 
one can still symbolically differentiate a polynomial over F, to produce new 
polynomials over smaller sets of variables. In general, when we’re trying to lower- 
bound the difficulty of computing a polynomial p, a common technique in arithmetic 


complexity is to look at various partial derivatives — 
the dimensions of vector spaces spanned by those partial derivatives, or the ranks 
of matrices formed from them—and then argue that, if p had a small circuit (or 
formula, or whatever), then those dimensions or ranks couldn’t possibly be as high 
as they are. 


In the case of Theorem 67, one proves the following two statements: 


—and in particular, at 


(1) If p is the permanent, of an n x n matrix of N = n? indeterminates, then there 
exists a point Xj € F% such that rank (H, (Xo)) = N. 

(2) If p is the determinant of an m x m matrix of affine functions in the NV 
indeterminates, then rank (H, (X)) < 2m for every X. 


Combining these, one gets m > n?/2, if p is both the n x n permanent and an 
m X m determinant. 

So to summarize, the “blowup” D(n) in embedding the permanent into the 
determinant is known to be at least quadratic and at most exponential. The huge gap 
here becomes a bit less surprising, once one knows that D (n) is tightly connected 
to the arithmetic circuit complexity of the permanent. In particular, recall that a 
formula is just a circuit in which every gate has a fanout of 1. Then Valiant [225] 
showed the following: 


Theorem 68 (Valiant [225]). D(n) < F(n) + 1, where F (n) is the size of the 
smallest arithmetic formula for the n x n permanent. 


Thus, if we could prove that D(n) grew faster than any polynomial, we’d have 
shown that the permanent has no polynomial-size formulas. But heightening the 
interest still further, Valiant et al. [228] showed that in the arithmetic world, there’s 
a surprisingly tight connection between formulas and circuits: 


Theorem 69 (Valiant et al. [228]). If a degree-d polynomial has an arithmetic 
circuit of size s, then it also has an arithmetic formula of size (sd) O08, 


Theorem 69 implies that C(n) < D(n)?"°8”, where C(n) is the size of the 
smallest arithmetic circuit for the n x n permanent. This means that, if we could 
prove that D(n) grew not only superpolynomially but faster than n?"°2”), we'd 
also have shown that C (n) grew superpolynomially, thereby establishing Valiant’s 
Conjecture 66. 

But lower-bounding D (v) is not merely sufficient for proving Valiant’s Conjec- 
ture; it’s also necessary! For recall that the nxn determinant has an arithmetic circuit 
of size O (n3 i and even O (n®). So we get the following chain of implications: 


D(n) > 28” —> F (n) > n°" (by Theorem 68) 
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=> C(n) > n° (by Theorem 69; this is Valiant’s Conjecture 66) 
=> D(n) > n° (by then?” arithmetic circuit for determinant) 


=> F (n) > n° (by Theorem 68). 


Today, a large fraction of the research aimed at proving P 4 NP is aimed, more 
immediately, at proving Valiant’s Conjecture 66 (see Agrawal [13] for a survey 
focusing on that goal). The hope is that, on the one hand, powerful tools from 
algebraic geometry and other fields can be brought to bear on Valiant’s problem, 


but on the other, that solving it could provide insight about the original P = NP 
problem. 


6.5.2 Arithmetic Circuit Lower Bounds 


I won’t do justice in this survey to the now-impressive body of work motivated 
by Conjecture 66; in particular, I'll say little about proof techniques. Readers who 
want to learn more about arithmetic circuit lower bounds should consult Shpilka 
and Yehudayoff [205, Chap. 3] for an excellent survey circa 2010, or Saraf [196] 
for a 2014 update. Briefly, though, computer scientists have tried to approach 
Conjecture 66 much as they’ve approached NP ¢ P/poly, by proving lower bounds 
against more and more powerful arithmetic circuit classes. In that quest, they’ve 
had some notable successes (paralleling the Boolean successes), but have also run 
up against some major differences from the Boolean case. 

For starters, just as Razborov [187] and others considered monotone Boolean 
circuits, one can also consider monotone arithmetic circuits (over fields such as R 
or Q), in which all coefficients need to be positive. Since the determinant involves 
—1 coefficients, it doesn’t make sense to ask about monotone circuits for Det (X), 
but one can certainly ask about the monotone circuit complexity of Per (X). And 
already in 1982, Jerrum and Snir [116] proved the following arithmetic counterpart 
of Razborov’s Theorem 44: 


Theorem 70 (Jerrum and Snir [116]). Any monotone circuit for Per (X) requires 
size 22), 


As another example, just as computer scientists considered constant-depth 
Boolean circuits (the classes AC’, ACC, TC°, and so on), so we can also consider 
constant-depth arithmetic circuits, which are conventionally denoted XII, HIT, 
etc. to indicate whether they represent a multivariate polynomial as a sum of 
products, a sum of product of sums, etc. It’s trivial to prove exponential lower 
bounds on the sizes of depth-two (IT) circuits: that just amounts to lower-bounding 
the number of monomials in a polynomial. More interesting is the following result: 
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Theorem 71 (Grigoriev and Karpinski [93], Grigoriev and Razborov [94]). 
Over a finite field, any STILE circuit for Det (X) requires size 2°. (Indeed, this 
is true even for circuits representing Det (X) as a function.) 


Curiously, over infinite fields, the best lower bound that we have is still a much 
weaker one, due to Shpilka and Wigderson [204]: 


Theorem 72 (Shpilka and Wigderson [204]). Over infinite fields, any XTTX 
circuit for Det (X) requires size Q (n*/ log n). 


Theorems 71 and 72 are stated for the determinant, although they have analogues 
for the permanent. In any case, these results certainly don’t succeed in showing that 
the permanent is harder than the determinant. 

The situation is better when we restrict the fanin of the multiplication gates. In 
particular, by a ST! D1! circuit, let’s mean a depth-4 circuit where every inner 
multiplication gate has fanin at most a, and every bottom multiplication gate has 
fanin at most b. Then in 2013, Gupta et al. [99] proved the following. 


Theorem 73 (Gupta et al. [99]). Any SL] snlv"l circuit for Per (X) or 
Det (X) requires size 22(vi), 


Subsequently, Kayal et al. [126] proved a size lower bound of n@(V") for such 
circuits, though not for the permanent or determinant but for a different explicit 
polynomial. 

The situation is also better when we restrict to homogeneous arithmetic circuits. 
These are circuits where every gate is required to compute a homogeneous 
polynomial: that is, one where all the monomials have the same degree. Here Nisan 
and Wigderson [173] established the following in 1997. 


Theorem 74 (Nisan and Wigderson [173]). Over any field, any homogeneous 
DIY circuit for Det (X) requires size 22”, 


Going further, in 2014 Kayal et al. [125] gave an explicit polynomial for which 
any homogeneous UTI XT circuit requires size n&(Vn) 

It’s natural to wonder: why are we stuck talking about depth-3 and depth-4 
arithmetic circuits? Why couldn’t we show that the permanent and determinant have 
no constant-depth arithmetic circuits of subexponential size, just like Theorem 47 
and its successors showed that PARITY has no constant-depth Boolean circuits of 
subexponential size? After all, wasn’t the whole point of arithmetic complexity that 
it was supposed to be easier than Boolean complexity? 

In 2008, Agrawal and Vinay [15] gave a striking answer to these questions; 
they called their answer “the chasm at depth four.” In particular, building on the 
earlier work of Valiant et al. [228] (Theorem 69), Agrawal and Vinay showed that, 
if we managed to prove strong enough lower bounds for depth-4 arithmetic circuits, 
then we’d also get superpolynomial lower bounds for arbitrary arithmetic circuits! 
Here’s one special case of their result: 
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Theorem 75 (Agrawal and Vinay [15]). Suppose that Per (X) requires depth-4 
arithmetic circuits (even homogeneous ones) of size 22. Then Per (X) requires 
arithmetic circuits of superpolynomial size, and Valiant’s Conjecture 66 holds. 


Subsequently, Koiran [133] and Tavenas [221] showed that Valiant’s Conjecture 
would follow, not merely from a 2°” size lower bound for homogeneous depth- 
4 circuits computing the permanent, but from any size lower bound better than 
n2(v7), In an even more exciting development, Gupta et al. [98] reduced the depth 
from four to three (though only for fields of characteristic 0, and no longer allowing 
homogeneity): 


Theorem 76 (Gupta et al. [98]). Suppose that Per (X) requires depth-3 arithmetic 
circuits of size more than n@(vn), over fields of characteristic 0. Then Per (X) 


requires arithmetic circuits of superpolynomial size, and Valiant’s Conjecture 66 
holds. 


These results can be considered extreme versions of the depth reduction of Brent 
[52] (see Proposition 24). I should mention that all of these results hold, not just for 
the permanent, but for any homogeneous polynomial of degree n°. In particular, 
by applying their depth reduction “in the opposite direction” for the determinant, 
Gupta et al. [98] were able to show that there exist depth-3 arithmetic circuits of 
size n°(V") for Det (X). This provides an interesting counterpoint to the result of 
Nisan and Wigderson [173] (Theorem 74), which showed that size 22) is needed 
for the determinant if we restrict to depth-3 homogeneous circuits. 

There are yet other results in this vein, which give yet other tradeoffs. But perhaps 
we should step back from the flurry of theorems and try to summarize the situation. 
After decades of research in arithmetic circuit complexity, we now have lower 
bounds of the form n@(v”) on the sizes of depth-3 and depth-4 arithmetic circuits 
computing explicit polynomials (subject to various technical restrictions). On the 
other hand, we also have a deep explanation for why the progress has stopped at the 
specific bound n® (V"): because any lower bound even slightly better than that would 
already prove Valiant’s Conjecture, that the permanent is superpolynomially harder 
than the determinant! It’s as if, in arithmetic complexity, we reach a terrifying 
precipice—beyond which we can no longer walk but need to fly—sooner than we 
do in the Boolean case. And around 2014, we learned exactly where that precipice 
is and walked right up to it, but we still haven’t jumped. 

In this connection, it’s worth pointing out that, with the exception of Theorem 67 
by Mignon and Ressayre [153], none of the results in this section actually 
differentiate the permanent from the determinant: that is, none of them prove a 
lower bound for Per (X) better than the analogous lower bound known for Det (X). 
Eventually, of course, any proof of Valiant’s Conjecture will need to explain why 
the permanent is harder than the determinant, which is one of the main motivations 
for the Mulmuley-Sohoni program (see Sect. 6.6). 

Let me end this section by discussing two striking results of Ran Raz that didn’t 
quite fit into the narrative above. The first result is a superpolynomial lower bound 
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on the sizes of multilinear formulas. An arithmetic formula is called multilinear 
if the polynomial computed by each gate is a multilinear polynomial (that is, 
no variable is raised to a higher power than 1). Notice that the permanent and 
determinant are both multilinear polynomials. For that reason, they can be computed 
by multilinear formulas, and it makes sense to ask about the size of the smallest such 
formulas. 

In a 2004 breakthrough, Raz [182] proved the following. 


Theorem 77 (Raz [182]). Any multilinear formula for Per (X) or Det (X) requires 
size n2{logn) 54 


What made Theorem 77 a breakthrough was that there was no restriction on the 
formula’s depth. The proof was via the random restriction method from Sect. 6.2.3, 
combined with the idea (common in arithmetic complexity) of using matrix rank as 
a progress measure. In more detail, let p : {0, 1}” — R be a polynomial computed 
by a small multilinear formula: for simplicity, we’ll take p’s inputs to be Boolean. 
Then basically, one randomly partitions p’s input variables into two small sets X = 
{x1,...,x¢} and Y = {y,,..., y,}, and a large set Z of size n — 2k. (Here one should 
imagine, say, k = n'/?.) One then randomly fixes the variables in Z to 0’s or 1’s, 
while leaving the variables in X and Y unfixed. Next, one defines a matrix M ¢€ 
R2x2 whose rows are indexed by the 2‘ possible assignments to X, whose columns 
are indexed by the 2* possible assignments to Y, and whose (X, Y) entry equals 
p(X, Y, Z). Finally, one proves the following two statements: 


¢ With high probability, M has rank much smaller than 2“. This is the hard part of 
the proof: one uses the assumption that p has a small multilinear formula, and 
then argues by induction on the formula. 

¢ Ifp represents the function f of interest to us (say, the permanent or determinant), 
then rank (M) = 2* with certainty. 


Together, these yield the desired contradiction, showing that f can’t have had a 
small multilinear formula after all. 

It seems likely that the lower bound in Theorem 77 could be improved from 
n@0s”) all the way up to 2°), but this remains open. Raz and Yehudayoff [185] 
did manage to prove an exponential lower bound for constant-depth multilinear 
formulas computing the permanent or determinant; and in a separate work [186], 
they also proved a 22 lower bound for “non-cancelling” multilinear formulas 
computing an explicit polynomial f (not the permanent or determinant). Here “non- 
cancelling”—a notion that I defined in [2]—basically means that nowhere in the 
formula are we allowed to add two polynomials that “almost perfectly” cancel each 
other out, leaving only a tiny residue. 


54An immediate corollary is that any multilinear circuit for Per(X) or Det (X) requires depth 
Q (log? n). 
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Of course, just like with the arithmetic circuit lower bounds discussed earlier, so 
far all the known multilinear formula lower bounds fail to distinguish the permanent 
from the determinant. 

The second result of Raz’s concerns so-called elusive functions. Given a polyno- 
mial curve f : C > C”, Raz calls f elusive if f is not contained in the image of 
any polynomial mapping g : C”"! — C" of degree 2. He then proves the following 
striking theorem. 


Theorem 78 (Raz [183]). Suppose there exists an elusive function whose coeffi- 
cients can be computed in polynomial time. Then Per (X) requires arithmetic circuits 
of superpolynomial size, and Valiant’s Conjecture 66 holds. 


Arguably, this makes Valiant’s Conjecture look even more like a question of 
pure algebraic geometry than it did before! As evidence that the “elusive function” 
approach to circuit lower bounds is viable, Raz then constructed an explicit f that 
was elusive in a weak sense, which was already enough to imply the following new 
lower bound: 


Theorem 79 (Raz [183]). For every r, there is an explicit polynomial p with n 
variables and degree O(r), such that any depth-r arithmetic circuit for p (over any 
field) requires size n't 20/9, 


6.5.3 Arithmetic Natural Proofs? 


In Sect. 6.5.2, we saw arithmetic circuit lower bounds that, again and again, seem to 
go “right up to the brink” of proving Valiant’s Conjecture, but then stop short. Given 
this, it’s natural to wonder what the barriers are to further progress in arithmetic 
complexity, and how they relate to the barriers in the Boolean case. 

We’ve already discussed one obvious barrier, which is that eventually we need 
techniques that work for the permanent but fail for the determinant. It might also be 
interesting to define an arithmetic analogue of the relativization barrier (Sect. 6.1.2). 
To my knowledge, this hasn’t been done, but my guess is that in the arithmetic 
setting, the natural choices for oracles would look a lot like the algebraic oracles 
studied by Aaronson and Wigderson [10] (see Sect. 6.3.3). With a notion of “oracle” 
in hand, one could probably show that most arithmetic circuit lower bounds require 
arithmetically non-relativizing techniques. On the other hand, this wouldn’t be much 
of an obstruction, since even the results discussed in Sect. 6.5.2 should already evade 
the relativization barrier, for the same reason as those of Sects. 6.2.3 and 6.2.4. 

In the rest of this section, I’d like to discuss the contentious question of whether 
or not arithmetic circuit complexity faces a natural proofs barrier, in the sense of 
Razborov and Rudich [191]. Recall from Sect. 6.2.5 that a circuit lower bound proof 
is called natural if, besides proving that the specific function f of interest to us is not 
in a circuit class C, the proof also provides a polynomial-time algorithm A that takes 
as input a function’s truth table, and that certifies a 1/n°“) fraction of all functions 
as not belonging to C. Such an A can be used to distinguish functions in C from 
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random functions with non-negligible bias. Meanwhile, the class C has a natural 
proofs barrier if C contains pseudorandom function families, which cannot be so 
distinguished from random functions, and whose existence is therefore incompatible 
with the existence of A. 

In the arithmetic setting, presumably we’d call a proof natural if it yields a 
polynomial-time algorithm* that takes as input, say, the complete output table 
of a homogeneous degree-d polynomial p : F” — F over a finite field F, and 
that certifies a 1/n°“ fraction of all such polynomials as not belonging to the 
arithmetic circuit class C. Also, we’d say that C has a natural proofs barrier 
if C contains pseudorandom polynomial families. By this, we mean families of 
homogeneous degree-d polynomials, p, : F" — F, that no |F|C-time algorithm 
can distinguish from uniformly-random homogeneous degree-d polynomials with 
non-negligible bias. (We can no longer talk about uniformly-random functions, since 
an algorithm can easily ascertain, for example, that p, is a degree-d polynomial.) By 
exactly the same logic as in the Boolean case, if C is powerful enough to compute 
pseudorandom polynomials, then no natural proof can show that a polynomial is not 
inc. 

Now, one point that’s not disputed is that all the arithmetic circuit lower bounds 
discussed in Sect. 6.5.2 are natural in the above sense. I didn’t say much about how 
the lower bounds are proved, but as mentioned in Sect. 6.5.1, arithmetic circuit 
lower bounds generally proceed by finding some parameter a (p) associated with 
a polynomial p—say, the rank of its Hessian matrix, or the dimension of a vector 
space spanned by p’s partial derivatives—to use as a “progress measure.” The proof 
then argues that 


(1) @ (p) is large for the specific polynomial p of interest to us (say, the permanent 
or determinant), but 
(2) every gate added to our circuit or formula can only increase a (p) by so much, 


thereby implying that p requires many gates. Furthermore, virtually any progress 
measure q@ that’s a plausible choice for such an argument—and certainly the 
ones used in the existing result—will be computable in \F|e” time, and will be 
maximized by a random polynomial p of the appropriate degree. Alas, this implies 
that the argument is natural! If the circuit class C has a natural proofs barrier, then 
no such argument can possibly prove p ¢ C. 

The only part that’s controversial is whether arithmetic circuit classes do have 
a natural proofs barrier. To show that they did, we’d need plausible candidates for 
pseudorandom polynomials—e.g., homogeneous degree-d polynomials p : F” > F 
that actually have small arithmetic circuits, but that look to any efficient test just 
like random homogeneous polynomials of degree d. The trouble is that, while 
cryptographers know a great deal about how to construct pseudorandom functions, 


For simplicity, here I'll assume that we mean an “ordinary” (Boolean) polynomial-time 
algorithm, though one could also require polynomial-time algorithms in the arithmetic model. 
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the accepted constructions are all “inherently Boolean”; they don’t work in the 
setting of low-degree polynomials over a finite field. 

Thus, to take one example, the work of Goldreich, Goldwasser, and Micali 
(GGM) [88], combined with that of Hastad et al. [106], shows how to build a 
pseudorandom function family starting from any one-way function (see Sect. 5.3.1). 
And indeed, Razborov and Rudich [191] used a variant of the GGM construction 
in their original paper on natural proofs. However, if we try to implement the GGM 
construction using arithmetic circuits—say, using multiplication for the AND gates, 
1 — x for the NOT gates, etc.—we’ll find that we’ve produced an arithmetic circuit 
of n°“) depth, which computes a polynomial of exp (n°) degree: far too large. 

As I mentioned in Sect. 6.2.5, if we’re willing to assume the hardness of specific 
cryptographic problems, then there are also much more direct constructions of 
pseudorandom functions, which produce circuits of much lower depth. In particular, 
there’s the construction of Naor and Reingold [170], which is based on factoring 
and discrete logarithm; and that of Banerjee et al. [35], which is based on noisy 
systems of linear equations. Unfortunately, examination of these constructions 
reveals that they, too, require treating the input as a string of bits rather than of finite 
field elements. So for example, the Naor-Reingold construction involves modular 
exponentiation, which of course goes outside the arithmetic circuit model, where 
only addition and multiplication are allowed. 

At this point I can’t resist stating my own opinion, which is that the issue here is 
partly technical but also partly social. Simply put: Naor-Reingold and Banerjee et al. 
are taken to be relevant to natural proofs, because factoring, discrete logarithm, and 
solving noisy systems of linear equations have become accepted by the community 
of cryptographers as plausibly hard problems. Since real computers use Boolean 
circuits, and since in practice one normally needs pseudorandom functions rather 
than polynomials, cryptographers have had extremely little reason to study pseu- 
dorandom low-degree polynomials that are computed by small arithmetic circuits 
over finite fields. If they had studied that, though, it seems entirely plausible that 
they would’ve found decent candidates for such polynomials, and formed a social 
consensus that they indeed seem hard to distinguish from random polynomials. 

Motivated by that thought, in a 2008 blog post [5], I offered my own candidate 
for a pseudorandom family of polynomials, p, : F” — F, which are homogeneous 
of degree d = n°), My candidate was simply this: motivated by Valiant’s result 
[225] that the determinant can express any arithmetic formula (Theorem 68), take 
the random seed s to encode d? uniformly-random linear functions, Lj; : F” > F 
for all i,j € {1,...,d}. Then set 


Li (X1,---,Xn) eee Lia (x1,---,Xn) 


Ds (X1,--+-,%n) = Det 


La (X1,.--,%n) .— Lad (X1,--+5Xn) 


My conjecture is that, at least when d is sufficiently large, a random p, drawn 
from this family should require exp (a2) time to distinguish from a random 
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homogeneous polynomial of degree d, if we’re given the polynomial p : F” > F 
by a table of |F|” values. If d is a large enough polynomial in n, then exp (d%) is 


greater than \F|O , so the natural proofs barrier would apply. 
So far there’s been little study of this conjecture, with the exception of a 2012 
paper by Kayal [124], which proved the following. 


Theorem 80 (Kayal [124]). Let d < /n, and suppose we’re given black-box 
access to a degree-d polynomial p : F” — F, which is promised to be the permanent 
or determinant of a d x d matrix of linear forms. Then there exists a randomized, 
n°) -time algorithm to reconstruct the linear forms. 


If we don’t need to reconstruct the linear forms, but only break my pseudorandom 
polynomial candidate, then the results of Mignon and Ressayre [153] (Theorem 67) 
let us push the range where the algorithm works all the way up to d < n/ V2. It’s 
not known what happens for larger d. 

Of course, if our goal is to prove P # NP or NP ¢ P/poly, then perhaps the 
whole question of arithmetic natural proofs is ultimately beside the point. For to 
prove NP ¢ P/poly, we’ll need to become experts at overcoming the natural proofs 
barrier in any case: either in the arithmetic world, or if not, then when we move 
from the arithmetic world back to the Boolean one. 


6.6 Geometric Complexity Theory 


Pll end this survey with some extremely high-level remarks about Geometric 
Complexity Theory (GCT): a breathtakingly ambitious program to prove P 4 NP 
and related conjectures using algebraic geometry and representation theory. This 
program has been pursued since the late 1990s primarily by Ketan Mulmuley, 
though with important contributions by Milind Sohoni and others. 

I like to describe GCT as “the string theory of computer science.” Like string 
theory, GCT has the aura of an intricate theoretical superstructure from the far 
future, impatiently being worked on today despite our inability to test some key 
premises. Both theories have attracted interest partly because of “miraculous coin- 
cidences” (for string theory, these include anomaly cancellations and the prediction 
of gravitons; for GCT, exceptional properties of the permanent and determinant, 
and remarkable algorithms to compute the multiplicities of irreps). Both have been 
described as beautiful, deep, compelling, and even “the only game in town” (not 
surprisingly, a claim disputed by the fans of rival ideas!). And like with string theory, 
there’s scarcely any part of modern mathematics that’s not known or believed to be 
relevant to GCT.°° 

For both string theory and GCT, however, the central problem has been to say 
something novel and verifiable about the real-world phenomena that motivated the 


> Although so far, I haven’t seen a conjectured role for topology or logic in GCT. 
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theory in the first place, and to do so in a way that depends essentially on the theory’s 
core tenets (rather than on inspirations or analogies, or on incidental fragments of 
the theory). For string theory, that would mean making confirmed predictions or 
(e.g.) explaining the masses and generations of the elementary particles; for GCT, it 
would mean proving new circuit lower bounds. Indeed, proponents of both theories 
freely admit that one might spend one’s entire career on the theory, without living 
to see a payoff of that kind.*” 

GCT is not an easy subject, and I don’t pretend to be an expert. Part of the 
difficulty is inherent, while part of it is that the primary literature on GCT is not 
optimized for beginners: it contains a profusion of new ideas, extended analogies, 
speculations, theorems, speculations given acronyms and then referred to as if they 
were theorems, and advanced mathematics assumed as background, all in papers 
and manuscripts that cite each other in spaghetti fashion, making it difficult to track 
down where a given claim is proved. 

Mulmuley regards the beginning of GCT as a 1997 paper of his [158], which 
used algebraic geometry to prove a circuit lower bound for the classic MAXFLOW 
problem. In MAXFLOW, we’re given as input a description of an n-vertex directed 
graph G, with nonnegative integer weights on the edges (called capacities), along 
with designated source and sink vertices s and t, and a nonnegative integer w (the 
target). The problem is to decide whether w units of a liquid can be routed from 
s to ¢t, with the amount of liquid flowing along an edge e never exceeding e’s 
capacity. The MAXFLOw problem is famously in P,>* but the known algorithms 
are inherently serial, and it remains open whether they can be parallelized. More 
concretely, is MAXFLOW in NC’, or some other small-depth circuit class? Note 
that any proof of MAXFLOW ¢ NC! would imply the spectacular separation 
Nc! # P. Nevertheless, Mulmuley [158] managed to prove a strong lower bound 
for a restricted class of circuits, which captures almost all the MAXFLOW algorithms 
known in practice. 


Theorem 81 (Mulmuley [158]). Consider an arithmetic circuit for MAXFLOW, 
which takes as input the integer capacities and the integer target w.°? The gates, 
which all have fanin 2, can perform integer addition, subtraction, and multiplication 
(+,—, X) as well as integer comparisons (=, <,<). A comparison gate returns the 
integer | if it evaluates to “true” and 0 if it evaluates to “false.” The circuit’s final 


>7Furthermore, just as string theory didn’t predict what new data there has been in fundamental 
physics in recent decades (e.g., the dark energy), so GCT played no role in, e.g., the proof of 
NEXP ¢ ACC (Sect. 6.4.2), or the breakthrough lower bounds for small-depth circuits computing 
the permanent (Sect. 6.5.2). In both cases, to point this out is to hold the theory to a high and 
probably unfair standard, but also, ipso facto, to pay the theory a compliment. 

>8MAXFLOwW can easily be reduced to linear programming, but Ford and Fulkerson [78] also gave 
a much faster and direct way to solve it, which can be found in any undergraduate algorithms 
textbook. There have been further improvements since. 

5°We can assume, if we like, that G, s, and t are hardwired into the circuit. We can also allow 
constants such as 0 and | as inputs, but this is not necessary, as we can also generate constants 
ourselves using comparison gates and arithmetic. 
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output must be | if the answer is “yes” and 0 if the answer is “no.” Direct access to 
the bit-representations of the integers is not allowed, nor (for example) are the floor 
and ceiling functions. 

Any such circuit must have depth Q (Jn). Moreover, this holds even if the 
capacities are restricted to be O (n°) -bit integers. 


Similar to previous arithmetic complexity lower bounds, the proof of Theorem 81 
proceeds by noticing that any small-depth arithmetic circuit separate the yes- 
instances from the no-instances via a small number of low-degree algebraic surfaces. 
It then appeals to results from algebraic geometry (e.g., the Milnor-Thom Theorem) 
to show that, for problems of interest (such as MAXFLOW), no such separation 
by low-degree surfaces is possible. As Mulmuley already observed in [158], the 
second part of the argument would work just as well for a random problem. So 
we get a natural proof in the Razborov-Rudich sense (Sect. 6.2.5), meaning that the 
same technique can’t possibly work to prove NC’ # P: it must break down when 
arbitrary bit operations are allowed. Nevertheless, for Mulmuley, Theorem 81 was 
strong evidence that algebraic geometry provides a route to separating complexity 
classes. 

The central papers on GCT, written between 2001 and 2013, are as follows: 


¢ GCT1 [167] by Mulmuley and Sohoni, which already contains almost all of the 
main ideas, and should be read (perhaps along with GCT6) before attempting the 
others. 

¢ GCT2 [168], by Mulmuley and Sohoni, which has further technical results about 
representation-theoretic obstructions, and which conjectures that the orbit clo- 
sures relevant to GCT are essentially captured by their representation-theoretic 
data. 

¢ GCT3 [166], by Mulmuley, Narayanan, and Sohoni, which gives an efficient 
algorithm, based on linear programming, to decide the positivity of Litthewood- 
Richardson coefficients. Biirgisser and Ikenmeyer [60] later gave a combinatorial 
algorithm for the same problem. 

¢ GCT4 [45], by Blasiak, Mulmuley, and Sohoni, which gives a “positive formula” 
for certain Kronecker coefficients (putting the computation of those coefficients 
in #P). 

¢ GCTS [165] by Mulmuley, which gives a conditional derandomization of 
Noether’s Normalization Lemma, a subject of perhaps tangential relevance to the 
rest of the GCT program. Forbes and Shpilka [77] later gave an unconditional 
version of the same result. 

¢ GCT6 [162] by Mulmuley, which steps back and revisits the basics of GCT in 
a somewhat more philosophical way, explaining what Mulmuley calls “the Flip” 
(and what this survey called “ironic complexity theory”), and arguing the need 
for explicit obstructions. 

¢ GCT7 [159] and GCT8 [160] by Mulmuley, which give conjectures about 
quantum groups that, if true, would yield positive formulas for certain plethysm 
problems (again, putting those problems in #P). 
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Readers seeking a less overwhelming introduction can try Mulmuley’s overviews 
in Communications of the ACM [164] or in Journal of the ACM [163]; or lecture 
notes and other materials on Mulmuley’s GCT website [157]; or surveys by Regan 
[192] or Landsberg [137] (the latter is for geometers, not computer scientists). In my 
view, however, perhaps the most beginner-friendly exposition of GCT yet written is 
contained in Joshua Grochow’s Ph.D. thesis [95, Chap. 3]. Indeed, some readers 
might want to put my survey down at this point and read Grochow’s thesis instead. 


6.6.1 From Complexity to Algebraic Geometry 


So, what is GCT? It’s easiest to understand GCT as a program to prove Valiant’s 
Conjecture 66—that is, to show that any affine embedding of the n x n permanent 
over C into the m x m determinant over C requires (say) m = an and hence, 
that the permanent requires exponential-size arithmetic circuits. GCT also includes 
an even more speculative program to prove Boolean lower bounds, such as NP ¢ 
P/poly and hence P # NP. However, if one accepts the premises of GCT in the 
first place (e.g., the primacy of algebraic geometry for circuit lower bounds), then 
one might as well start with the permanent versus determinant problem, since that’s 
where the core ideas of GCT come through the most clearly. 

The first observation Mulmuley and Sohoni make is that Valiant’s Conjecture 66 
can be translated into an algebraic-geometry conjecture about orbit closures. In 
more detail, consider a group G that acts on a set V: in the examples that interest 
us, G will be a group of complex matrices, while V will be a high-dimensional 
vector space over C (e.g., the space of all homogeneous degree-n polynomials over 
some set of variables). Then the orbit of a point v € V, denoted Gu, is the set 
{g-uv:g © G}. Also, the orbit closure of x, denoted Gv, is the closure of Gv in 
the usual complex topology. It contains all the points that can be arbitrarily well 
approximated by points in Gu. 

To be more concrete, let G = GL,,2 (C) be the general linear group: the group 
of all invertible m* x m* complex matrices. Also, let V be the vector space of all 
homogeneous degree-m complex polynomials over m? variables; the variables are 
labeled x1.1,...,Xmm, and we’ll think of them as the entries of an m x m matrix 
X. (Recall that a homogeneous polynomial is one where all the monomials have the 
same degree m; restricting to them lets GCT talk about linear maps rather than affine 
ones.) Then G acts on V in an obvious way: for all matrices A € G and polynomials 
p € V, we can set (A - p) (x) := p (Ax). Indeed, this action is a group representation: 
each A € G acts linearly on the coefficients of p, so we get a homomorphism from 
G to the linear transformations on V. 

Now, we’d like to interpret both the m x m determinant and the n x n permanent 
(n < m) as points in V, in order to phrase the permanent versus determinant 
problem in terms of orbit closures. For the determinant, this is trivial: Det,, is a 
degree-m homogeneous polynomial over the x;;’s. The n x n permanent, on the other 
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hand, is a lower-degree polynomial over a smaller set of variables. GCT solves that 
problem by considering the so-called padded permanent, 


Petin n (X) = 5 i Per, (X|n) ’ 
where X|,, denotes the top-left n x n submatrix of X, and xX, is just some entry of 
X that’s not in X|,,. This is a homogeneous polynomial of degree m. 
Let YDetm ‘= G- Det, be the orbit closure of the m x m determinant, and let 
Xrermn = G- Per* , be the orbit closure of the padded n x n permanent. I can now 


mn 


state the central conjecture that GCT, in its current incarnation, seeks to prove. 


(1) 


Conjecture 82 (Mulmuley-Sohoni Conjecture). [fm = 2” , then for all suffi- 
ciently large n we have Pet, € Xdetm, Or equivalently Xpermn CL Xdetm- In other 
words, the padded permanent is not in the orbit closure of the determinant. 


Mulmuley and Sohoni’s first major observation is that a proof of Conjecture 82 
(or indeed, any lower bound on m better than n2“°2”)) would imply a proof of 
Valiant’s Conjecture: 


Proposition 83 ([167]). Suppose there’s an affine embedding of the nxn permanent 
into the m x m determinant: i.e.,D(n) < m, in the notation of Sect. 6.5.1. Then 
Per, , © Aver 


Let me make two remarks about Proposition 83. First, for the proposition to 
hold, it’s crucial that we’re talking about the orbit closure, not just about the orbit. 
It’s easy to see that Per;,,, ¢ G- Det,,—for example, because every element of 
G - Det,, is irreducible (can’t be factored into lower-degree polynomials), whereas 
the padded permanent is clearly reducible. But that tells us only that there’s no 
invertible linear transformation of the variables that turns the determinant into the 
padded permanent, not that there’s no linear transformation at all. In GCT, linear 
changes of variable play the role of reductions—so, while the orbit of f plays the 
role of the “f-complete problems,” it’s the orbit closure that plays the role of the 


complexity class of functions reducible to f. 


More broadly, I’ve found, there are many confusing points about GCT whose resolutions require 
reminding ourselves that we’re talking about orbit closures, and not only about orbits. For example, 
the plan of GCT is ultimately to show, roughly speaking, that the n x n permanent has “too little 
symmetry” to be embedded into the m x m determinant, unless m is much larger than n. But in that 
case, what about (say) a random arithmetic formula f of size n, which has no nontrivial symmetries, 
but which clearly can be embedded into the (n + 1) x (n + 1) determinant, by Theorem 68? Even 
though this f clearly isn’t characterized by its symmetries, mustn’t the embedding obstructions 
for f be a strict superset of the embedding obstructions for the permanent—since f’s symmetries 
are a strict subset of the permanent’s symmetries—and doesn’t that give rise to a contradiction? 
According to Mulmuley (personal communication), the solution to the apparent paradox is that this 
argument would be valid if we were talking only about orbits, but it’s not valid for orbit closures. 
With orbit closures, the set of obstructions doesn’t depend in a simple way on the symmetries of 
the original function, so that it’s possible that an obstruction for the permanent would fail to be an 
obstruction for f. 
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The second remark is that, despite their similarity, it’s unknown whether 
Conjecture 82 is equivalent to Valiant’s conjecture that the permanent requires affine 
embeddings into the determinant of size D (n) > 2” The reason is that, as far as 
anyone knows, there might be points in the orbit closure of the determinant that 
aren’t in its “endomorphism orbit” (that is, the set of polynomials that have not- 
necessarily-invertible linear embeddings into the determinant) In complexity terms, 
these points would be homogeneous degree-m polynomials that can be arbitrarily 
well approximated by determinants of m x m matrices of linear functions, but not 
represented exactly. 

See Grochow [95] for further discussion of both issues. 


6.6.2 Characterization by Symmetries 


So far, it seems like all we’ve done is restated Valiant’s Conjecture in a more abstract 
language and slightly generalized it. But now we come to the central insight of GCT, 
which is that the permanent and determinant are both special, highly symmetric 
functions, and we can leverage that fact to learn more about their orbit closures than 
we could if they were arbitrary functions. For starters, Per (X) is symmetric under 
permuting X’s rows or columns, transposing X, and multiplying the rows or columns 
by scalars that multiply to 1. That is, we have 


Per (X) = Per (X’) = Per (PXQ) = Per (AXB) (2) 


for all permutation matrices P and Q, and all diagonal matrices A and B such that 
Per (A) Per (B) = 1. The determinant has an even larger symmetry group: we have 


Det (X) = Det (X") = Det (AXB) (3) 


for all matrices A and B such that Det (A) Det (B) = 1. 

But there’s a further point: it turns out that the permanent and determinant are 
both uniquely characterized (up to a constant factor) by their symmetries, among 
all homogeneous polynomials of the same degree. More precisely: 


Theorem 84. Let p be any degree-m homogeneous polynomial in the entries of 
X € C”™" that satisfies p(X) = p (PXQ) = p (AXB) for all permutation matrices 
P,Q and diagonal A, B with Per (A) Per (B) = 1. Then p(X) = a Per (X) for some 
a € C. Likewise, let p be any degree-m homogeneous polynomial in the entries of 
X € C”™" that satisfies p(X) = p(AXB) for all A,B with Det (A) Det (B) = 1. 
Then p (X) = a Det (X) for some a € C.°! 


6! Note that we don’t even need to assume the symmetry p(X) = p (x’); that comes as a free 
byproduct. Also, it might seem like “cheating” that we use the permanent to state the symmetries 
that characterize the permanent, and likewise for the determinant. But we’re just using the 
permanent and determinant as convenient ways to specify which matrices A, B we want, and could 
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Theorem 84 is fairly well-known in representation theory; see Grochow [95, 
Propositions 3.4.3 and 3.4.5] for an elementary proof, using Gaussian elimination 
for the determinant and even simpler considerations for the permanent. Notice that 
we’re not merely saying that any polynomial p with the same symmetry group as 
the permanent is a multiple of the permanent (and similarly for the determinant), 
but rather that any p whose symmetry group contains the permanent’s is a multiple 
of the permanent. 

In a sense, Theorem 84 is the linchpin of the entire GCT program. Among 
other things, it’s GCT’s answer to the question of how it will overcome the 
natural proofs barrier. For notice that, if we picked a degree-m homogeneous 
polynomial at random, it almost certainly wouldn’t be uniquely characterized by 
its symmetries, as the permanent and determinant are.°* Thus, if a proof that the 
permanent is hard relies on symmetry-characterization, we need not fear that the 
same proof would work for a random homogeneous polynomial, and thereby give 
us a way to break arithmetic pseudorandom functions (Sect. 6.5.3). While this isn’t 
mentioned as often, Theorem 84 should also let GCT overcome the relativization 
and algebrization barriers, since (for example) a polynomial that was #P“-complete 
for some oracle A, rather than #P-complete like the permanent was, would not have 
the same symmetries as the permanent itself. 


6.6.3 The Quest for Obstructions 


Because the permanent and determinant are characterized by their symmetries, and 
because they satisfy another technical property called “partial stability,’ Mulmuley 
and Sohoni observe that a field called geometric invariant theory can be used to get 
a handle on their orbit closures. I won’t explain the details of how this works (which 
involve something called Luna’s Etale Slice Theorem [150]), but will just state the 
punchline. 

Given a set S C CY, define R[S], or the coordinate ring of S, to be the vector 
space of all complex polynomials g : CY — C, with two polynomials identified if 
they agree on all points x € S. Then we’ll be interested in Rpet := R[YpDetm] and 


Reer = R [7X Perm.n]: the coordinate rings of the orbit closures of the determinant and 


tm—l\ : 
the padded permanent. In this case, N = ( ') is the dimension of the vector 


space of homogeneous degree-m polynomials over m” variables. So the coordinate 
rings are vector spaces of polynomials over N variables: truly enormous objects. 
Next, let g : CY’ — C be one of these “big” polynomials, whose inputs are 
the coefficients of a “small” polynomial p (such as the permanent or determinant). 
Then we can define an action of the general linear group, G = GL,,2 (C), on q, via 
(A: q) (p (x)) := q (p (Ax)) for all A € G. In other words, we take the action of G on 


give slightly more awkward symmetry conditions that avoided them. (This is especially clear for 
the permanent, since if A is diagonal, then Per (A) is just the product of the diagonal entries.) 


6 See Grochow [95, Proposition 3.4.9] for a simple proof of this, via a dimension argument. 
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the “small” polynomials p that we previously defined, and use it to induce an action 
on the “big” polynomials g. Notice that this action fixes the coordinate rings Rpet 
and Rpey (i.e., just shifts their points around), simply because the action of G fixes 
the orbit closures 7pet.m and XPerm.n themselves. As a consequence, the actions on G 
on Rpet and Rpe; give us two representations of the group G: that is, homomorphisms 
that map the elements of G to linear transformations on the vector spaces Rpet and 
Rper respectively. Call these representations ppet and pper respectively. 

Like most representations, Pper and pper can be decomposed uniquely into 
direct sums of irreducible representations, or “irreps” (which are not further 
decomposable). In particular, let p : G > C'** be any irrep of G. Then p occurs 
with some nonnegative integer multiplicity, call it Apet (Pp), iN Ppet, and with some 
possibly different multiplicity, call it Ape; (0), in Pper. We’re now ready for the 
theorem that sets the stage for the rest of GCT. 


Theorem 85 (Mulmuley-Sohoni [167]). Suppose there exists an irrep p such that 
Aper (0) > Apet (p). Then Per*, ¢ Xpetm: that is, the padded permanent is not in 


mn 
the orbit closure of the determinant. 


Note that Theorem 85 is not an “if and only if”: even if Per, ¢ Xdetm, there’s 
no result saying that the reason must be representation-theoretic. In GCT2 [168], 
Mulmuley and Sohoni conjecture that the algebraic geometry of Ypetym is in some 
sense completely determined by its representation theory, but if true, that would 
have to be for reasons rather specific to 7pet,m (or other “complexity-theoretic” orbit 
closures). 

If Aper(P) > Apet (pe), then Mulmuley and Sohoni call p an obstruction to 
embedding the permanent into the determinant. Any obstruction would be a witness 
to the permanent’s hardness: if one likes, it would prove that the m x m determinant 
has “too much symmetry” to express the padded n x n permanent, unless m is much 
larger than n. From this point forward, GCT is focused entirely on the hunt for such 
an obstruction. 

A priori, one could imagine proving nonconstructively that an obstruction p must 
exist, without actually finding the obstruction. However, Mulmuley and Sohoni 
emphatically reject that approach. They want not merely any proof of Conjecture 82, 
but an “explicit” proof: that is, one that yields an algorithm that actually finds an 
obstruction p witnessing Per;, , ¢ XDetm, in time polynomial in m and n. Alas, as you 
might have gathered, the representations ppe and ppe, are fearsomely complicated 
objects—so even if we accept for argument’s sake that obstructions should exist, we 
seem a very long way from algorithms to find them in less than astronomical time.°° 

For now, therefore, Mulmuley and Sohoni argue that the best way to make 
progress toward Conjecture 82 is to work on more and more efficient algorithms 


31 principle, pper and ppe, are infinite-dimensional representations, so an algorithm could search 
them forever for obstructions without halting. On the other hand, if we impose some upper bound 
on the degrees of the polynomials in the coordinate ring, we get an algorithm that takes “merely” 
doubly- or triply-exponential time. 
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to compute the multiplicities of irreps in complicated representations like ppe: and 
Pper- The hope is that, in order to design those algorithms, we’ll be forced to acquire 
such a deep understanding that we’ll then know exactly where to look for a p such 
that Ape (0) > Apet (Pe). So that’s the program that they’ve pursued for the last 
decade, for example in GCT 3, 4, 7, and 8 [45, 159, 160, 166]. 

The central idea here—that the path to proving P # NP will go through 
discovering new algorithms, rather than through ruling them out—is GCT’s version 
of “ironic complexity theory,” discussed in Sect. 6.4. What I’ve been calling “irony” 
in this survey, Mulmuley calls “The Flip” [161]: that is, flipping lower-bound 
problems into upper-bound problems, which we have a far better chance of solving. 

Stepping back from the specifics of the GCT program, Mulmuley’s view is that, 
before we prove (say) NP ¢ P/poly, a natural intermediate goal is to find an 
algorithm A that takes a positive integer n as input, runs for n?“) time (or even 
exp (n°) time), and then outputs a proof that 3SAT instances of size n have 
no circuits of size m, for some superpolynomial function m. Such an algorithm 
wouldn’t immediately prove NP ¢ P/poly, because we might still not know 
how to prove that A succeeded for every n. Even so, it would clearly be a titanic 
step forward, since we could run A and check that it did succeed for every n we 
chose, perhaps even for n’s in the billions. At that point, we could say either that 
NP ¢ P/poly, or else that NP C P/poly only “kicks in” at such large values of n as 
to have few or no practical consequences. Furthermore, Mulmuley argues, we’d then 
be in a much better position to prove NP ¢ P/poly outright, since we’d “merely” 
have to analyze A, whose very existence would obviously encode enormous insight 
about the problem, and prove that it worked for all n.© 

Mulmuley and others have indeed made progress in discovering efficient algo- 
rithms to the compute multiplicities of irreps, though the state-of-the-art is still 
extremely far from what Mulmuley conjectures is possible. To give an example, 
a Littlewood-Richardson coefficient is the multiplicity of a given irrep in a tensor 
product of two irreps of the general linear group GL,, (C). In GCT3 [166], Mulmuley 
et al. observed that a result of Knutson and Tao [132] implies that one can use 
linear programming, not necessarily to compute Littlewood-Richardson coefficients 
in polynomial time, but at least to decide whether they’re positive or zero.™ 
Biirgisser and Ikenmeyer [60] later gave a faster polynomial-time algorithm for 
the same problem; theirs was purely combinatorial and based on maximum flow. 
Note that, even if we only had efficient algorithms for positivity, those could still 
be useful for finding so-called occurrence obstructions: that is, irreps p such that 


4A very loose analogy: well before Andrew Wiles proved Fermat’s Last Theorem [233], that 
x" + y" = z” has no nontrivial integer solutions for any n > 3, number theorists knew a reasonably 
efficient algorithm that took an exponent n as input, and that (in practice, in all the cases that 
were tried) proved FLT for that n. Using that algorithm, in 1993—just before Wiles announced his 
proof—Buhler et al. [55] proved FLT for all n up to 4 million. 

6 As pointed out in Sect. 2.2.6, there are other cases in complexity theory where deciding positivity 
is much easier than exact counting: for example, deciding whether a graph has at least one perfect 
matching (counting the number of perfect matchings is #P-complete). 
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Aper (0) > 0 even though Ape: (p) = 0. Thus, the “best-case scenario” for GCT is 
that the permanent’s hardness can be witnessed not just by any obstructions, but by 
occurrence obstructions. 

In many cases, we don’t even know yet how to represent the multiplicity of 
an irrep as a #P function: at best, we can represent it as a difference between 
two #P functions. In those cases, the current research effort in GCT aims to give 
“positive formulas” for the multiplicities: in other words, to represent them as sums 
of exponentially many nonnegative terms, and thereby place their computation in #P 
itself. The hope is that a #P formula could be a first step toward a polynomial-time 
algorithm to decide positivity. To illustrate, Blasiak et al. achieved this for “two- 
row Kronecker coefficients” in GCT4 [45]. For other kinds of irreps, the quest for 
positive formulas has led to extremely deep waters, involving (for example) quantum 
groups and conjectured generalizations of the Riemann Hypothesis over finite fields. 
I won’t go into this here, but see GCT7 [159] and GCT8 [160] for this part of the 
story. 


6.6.4 GCT and P = NP 


Suppose—let’s dream—that everything above worked out perfectly. That is, sup- 
pose GCT led to the discovery of explicit obstructions for embedding the padded 
permanent into the determinant, and thence to a proof of Valiant’s Conjecture 66. 
How would GCT go even further, to prove P 4 NP? 

The short answer is that Mulmuley and Sohoni [167] defined an NP function 
called E, as well as a P-complete® function called H, and showed them to be 
characterized by symmetries in only a slightly weaker sense than the permanent and 
determinant are. The E and H functions aren’t nearly as natural as the permanent 
and determinant, but they suffice to show that P # NP could in principle be proven 
by finding explicit representation-theoretic obstructions, which in this case would 
be representations associated with the orbit of E but not with the orbit of H. Alas, 
because E and H are functions over a finite field F, rather than over C, the relevant 
algebraic geometry and representation theory would all be over finite fields as well. 
This leads to mathematical questions even less well-understood (!) than the ones 
discussed earlier, providing some additional support for the intuition that proving 
P # NP should be “even harder” than proving Valiant’s Conjecture. 

For illustration, let me now define Mulmuley and Sohoni’s E function. Let Xo 
and X; be two n x n matrices over the finite field F,. Also, given a binary string 
S = S1+++Sy, let X; be the m x n matrix obtained by choosing the ith column from Xo 
if s; = 0 or from X, if s; = 1 for alli € {1,...,m}. We then set 


66 A language L is called P-complete if (1) L € P, and (2) every L’ € P can be reduced to L by 
some form of reduction weaker than arbitrary polynomial-time ones (LOGSPACE reductions are 
often used for this purpose). 
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F (Xo,X\):= [] Det(X). 
se{0,1}” 


and finally set 
E (Xo, X1) = 1-— F (Xo, X1)4 


to obtain a function in {0,1}. Testing whether FE (Xo, X,) = 1 is clearly an NP 
problem, since an NP witness is a single s such that Det (X;) = 0. Furthermore, 
Gurvits [100] showed that, at least over Z, testing whether F (Xo, X,) = 0 is NP- 
complete. Interestingly, it’s not known whether E itself is NP-complete—though of 
course, to prove P 4 NP, it would suffice to put any NP problem outside P, not 
necessarily an NP-complete one. 

The main result about E (or rather F) is the following: 


Theorem 86 (see Grochow [95, Proposition 3.4.6]). Let p : Fe" — F, be any 
homogeneous, degree-n2” polynomial in the entries of Xo and X, that’s divisible by 
Det (Xo) Det (X1), and suppose every linear symmetry of F is also a linear symmetry 
of p. Then p (Xo, X1) = @F (Xo, X1) for some a € Fo. 


The proof of Theorem 86 involves some basic algebraic geometry. Even setting 
aside GCT, it would be interesting to know whether the existence of a plausibly-hard 
NP problem that’s characterized by its symmetries had direct complexity-theoretic 
applications. 

One last remark: for some complexity classes, such as BQP, we currently lack 
candidate problems characterized by their symmetries, so even by the speculative 
standards of this section, it’s unclear how GCT could be used to prove (say) P 4 
BQP or NP ¢ BQP. 


6.6.5 Reports from the Trenches 


Within the past few years, there’s been surprisingly rapid progress on connecting 
GCT to mainstream complexity theory; reproving known lower bounds using GCT 
or “GCT-like” methods; and investigating the truth or falsehood of some of GCT’s 
main hypotheses. Let me mention a few highlights. 

First, it’s now known that the central property of the permanent and determi- 
nant that GCT seeks to exploit—namely, their characterization by symmetries— 
does indeed have complexity-theoretic applications. In particular, Mulmuley [161] 
observed that one can use the symmetry-characterization of the permanent to give 
a different, and in many ways nicer, proof of the classic result of Lipton [146] that 
the permanent is self-testable: that is, given a circuit C that’s alleged to compute 
Per (X), in randomized polynomial time one can either verify that C (X) = Per (X) 
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for most matrices X, or else find a counterexample where C(X) 4 Per (X).° 
Subsequently, Kayal [124] took Mulmuley’s observation further by exploiting 
symmetry-characterization more heavily. Recall Kayal’s Theorem 80: that when 
d < /n, there’s a randomized polynomial-time algorithm to decide whether a 
given degree-d polynomial p (to which we have black-box access) is the permanent 
or determinant of a d x d matrix of affine functions jfj;, and if so, to find suitable 
Jij’s. Now, in the same paper [124], Kayal also showed that if g is an arbitrary 
polynomial, then deciding whether there exist A and b such that p (x) = q (Ax + b) 
is NP-hard. So what is it about the permanent and determinant that made the 
problem so much easier? The answer is their symmetry-characterization, along with 
more specific properties of the Lie algebras of the stabilizer groups of Per and 
Det. 

For more about the algorithmic consequences of symmetry-characterization, 
see for example Grochow [95, Chap. 4], who also partially derandomized Kayal’s 
algorithm and applied it to other problems such as matrix multiplication. In my 
view, an exciting challenge right now is to use the symmetry-characterization of 
the permanent, or perhaps of the E function from Sect. 6.6.4, to prove other new 
complexity results—not necessarily circuit lower bounds—that hopefully evade the 
relativization and algebrization barriers. 

In a second recent development, we now know that nontrivial circuit lower 
bounds—albeit, not state-of-the-art ones—can indeed be proved by finding 
representation-theoretic obstructions, just as GCT proposed. Recall from Sect. 6 
that the rank of a 3-dimensional tensor A € F”*’*" is the smallest r such that A 
can be written as the sum of r rank-one tensors, fj = x;yj;Zx. If F is a continuous 
field like C, then we can also define the border rank of A to be the smallest r 
such that A can be written as the limit of rank-r tensors. It’s known that border 
rank can be strictly less than rank.°* Border rank was introduced in 1980 by Bini 
et al. [44] to study matrix multiplication algorithms: indeed, one could call that the 
first appearance of orbit closures in computational complexity, decades before GCT. 

More concretely, the n x n matrix multiplication tensor, say over C, is defined to 
be the 3-dimensional tensor M,, € Cr xr xn whose (i,j) ,G,), (i,k) entries are all 
1, and whose remaining n® — n? entries are all 0. In 1973, Strassen [215] proved a 
key result connecting the rank of M,, to the complexity of matrix multiplication: 


Theorem 87 (Strassen [215]). The rank of M, equals the minimum number of 
nonscalar multiplications in any arithmetic circuit that multiplies two nxn matrices. 


67 This result of Lipton’s provided the germ of the proof that IP = PSPACE; see Sect. 6.3.1. 
Mulmuley’s test improves over Lipton’s by, for example, requiring only nonadaptive queries to 

C rather than adaptive ones. 

68 A standard example is the 2 x 2 x 2 tensor whose (2, 1, 1), (1,2, 1), and (1, 1, 2) entries are all 1, 

and whose 5 remaining entries are all 0. One can check that this tensor has a rank of 3 but border 

rank of 2. 
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As an immediate corollary, the border rank of M,, lower-bounds the arithmetic 
circuit complexity of n x n matrix multiplication. Bini [43] showed, moreover, 
that the exponent w of matrix multiplication is the same if calculated using rank 
or border rank—-so in that sense, the two are asymptotically equal. 

In a tour-de-force, in 2004 Landsberg [136] proved that the border rank of 
M) is 7, using non-GCT differential geometry methods. A corollary was that any 
procedure to multiply two 2x2 matrices requires at least 7 nonscalar multiplications, 
precisely matching the upper bound discovered by Strassen [214] in 1969. (The 
trivial upper bound is 8 multiplications.) 

I can now state what Biirgisser and Ikenmeyer [61] used GCT to achieve in 2013. 


Theorem 88 (Biirgisser and Ikenmeyer [61]). There are representation-theoretic 
(GCT) occurrence obstructions that witness that the border rank of M,, is at least 


32 
sn 2. 


By comparison, the state-of-the-art lower bound on the border rank of matrix 
multiplication, obtained using non-GCT methods, is the following. 


Theorem 89 (Landsberg and Ottaviani [138]). The border rank of M,, is at least 
2n? — n. 


Of course, since both lower bounds are still quadratic, neither of them shows that 
matrix multiplication requires more than ~ n? time, but it’s interesting to see how 
the best current GCT and non-GCT bounds compare. 

Meanwhile, another recent insight is that most known circuit lower bounds 
can be put into a “broadly GCT-like” format. In an elegant 2014 paper, Grochow 
[96] showed that the AC° and AC? [p] lower bounds of Sects. 6.2.3 and 6.2.4, the 
embedding lower bound of Mignon and Ressayre (Theorem 67), the lower bounds 
for small-depth arithmetic circuits and multilinear formulas of Sect. 6.5.2, and many 
other results can each be seen as constructing a separating module: that is, a 
“big polynomial” that takes as input the coefficients of an input polynomial, that 
vanishes for all polynomials in some complexity class C, but that doesn’t vanish 
for a polynomial g for which we’re proving that q ¢ C. Interestingly, the lower 
bounds that don’t fit into this format—such as MAgexp ¢ P/poly (Theorem 52) 
and NEXP ¢ ACC (Theorem 62)—essentially all use diagonalization as a key 
ingredient. Grochow’s result is an important unification, and it sounds like an 
impressive success for GCT. On the other hand, it’s important to understand that 
Grochow didn’t show that the known circuit lower bounds yield representation- 
theoretic obstructions: only that they yield separating modules. In other words, we 
might say, Grochow showed that the known circuit lower bounds fit into an abstract 
construal of the “GCT program,” but not that they fit into its really meaty part, into 
the part that makes GCT a program. 

A third development, just in 2015, is that we now have some initial results 
from actual searches for irreps that could serve as obstructions to embedding the 
permanent into the determinant. Unfortunately, not all the news is encouraging. 
For example, Ikenmeyer and Panova [109] have shown that “rectangular Kronecker 
coefficients” can’t be used to give occurrence obstructions separating Xperm. from 
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Xdetm- In other words, there’s a large class of irreps such that, if they occur at 
least once in the padded permanent representation ppe,, then they also occur at least 
once in the determinant representation ppet. Likewise, Biirgisser, Ikenmeyer, and 
Hiittenhain [62] have shown that any occurrence obstructions separating Xpermn 
from pet m must be “holes” of a certain monoid, further limiting where to look. 
It’s entirely possible that these negative results are artifacts of “looking under the 
lamp-post”—1.e., only at those irreps p for which one can actually calculate today 
whether Ape; (0) and Apet () are nonzero—but the worry is that irreps p such that 
Apdet (0) = 0 might be incredibly rare, in which case occurrence obstructions could 
be hard to find if they even exist. 

On the positive side, Ikenmeyer, Mulmuley, and Walter [108] have shown that 
there are superpolynomially many Kronecker coefficients that do vanish, thereby 
raising hope that occurrence obstructions might exist after all. Notably, they 
proved this result by first giving a #P formula for the relevant class of Kronecker 
coefficients, thereby illustrating the GCT strategy of first looking for algorithms and 
only later looking for the obstructions themselves. 


6.6.6 The Lessons of GCT 


Unsurprisingly, expert opinion is divided about GCT’s prospects. Some feel that 
GCT does little more than take complicated questions and make them even more 
complicated. Others feel it’s a perfectly natural and reasonable approach—maybe 
even the only extant approach that stands a chance—and that the complication is 
an inevitable byproduct of finally grappling with the real issues. Of course, one 
can also “cheer GCT from the sidelines” without feeling prepared to work on it 
oneself, particularly given the unclear prospects for any computer-science payoff in 
the foreseeable future. (Mulmuley once told me he thought it would take a hundred 
years until GCT led to major complexity class separations, and he’s the optimist!) 

Personally, I’d call myself a qualified fan of GCT, in much the same way and 
for the same reasons that I’m a qualified fan of string theory. I think all complexity 
theorists should learn about GCT—for one thing, because it has deep general lessons 
for the quest to prove P # NP, even if it ends up not succeeding, or not succeeding 
as Mulmuley envisioned. This section is devoted to what I believe those lessons are. 

A first lesson is that we can in principle evade the relativization, algebrization, 
and natural proofs barriers by using the existence of complete problems with 
special properties: as a beautiful example, the property of being “characterized by 
symmetries,’ which the permanent and determinant both enjoy. A second lesson is 
that “ironic complexity theory” has even further reach than one might have thought: 
one could use the existence of surprisingly fast algorithms, not merely to show that 
certain complexity collapses would violate hierarchy theorems, but also to help 
find certificates that problems are hard. A third lesson is that there’s at least one 
comprehensible route by which a circuit lower bound proof would need to know 
about huge swathes of “traditional, continuous” mathematics, as many computer 
scientists have long suspected (or feared!). 
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But none of those lessons really gets at how GCT changed my own thinking about 


P = NP. One of the most striking features of GCT is that, even as the approach 
stands today, it “knows” about various nontrivial problems in P, such as maximum 
flow and linear programming (because they’re involved in deciding whether the 
multiplicities of irreps are nonzero). We knew, of course, that any proof of P 4 NP 
would need to “know” that linear programming, matching, and so on are in P and are 
therefore different from 3SAT. (Indeed, that’s one way to express the main difficulty 


of the P = NP problem.) So the fact that GCT knows about all these polynomial- 
time algorithms seems both impressive and reassuring. But what’s strange is that 
GCT seems to know the upper bounds, not the lower bounds—the power of the 
algorithms, but not their limitations! In other words, consider a hypothetical proof 
of P # NP using GCT. If we ignore the details, and look from a distance of a 
thousand miles, the proof seems to be telling us not: ““You see how weak P is? You 
see all these problems it can’t solve?” but rather, “You see how strong P is? You 
see all these amazing, nontrivial problems it can solve?” The proof would seem to 
building up an impressive case for the wrong side of the argument! 

One response would be to point out that this is math, not a political debate, and 
leave it at that. But perhaps one can do better. Let A be a hypothetical problem, like 
matching or linear programming, that’s in P for a nontrivial reason, and that’s also 
definable purely in terms of its symmetries, as the permanent and determinant are. 
Then we can define an orbit closure y4, which captures all problems reducible to A. 
By assumption, ¥4 must be contained in yp, the orbit closure corresponding to a 
P-complete problem, such as Mulmuley and Sohoni’s H function (see Sect. 6.6.4). 
And hence, there must not be any representation-theoretic obstruction to such a 
containment. In other words, if we were to compute the multiplicities m,,m2,... of 
all the irreps in the representation associated with y,4, as well as the multiplicities 
n1,N2,... of the irreps associated with yp, we’d necessarily find that m; < n; for 
all i. 

Furthermore, by the general philosophy of GCT, once we had our long-sought 
positive formulas for these multiplicities (based on nonstandard quantum groups, 
the Riemann hypothesis, silly putty, herbal shampoo, or whatever), we might well 
be able to use those formulas to prove the above inequalities. 

Now let’s further conjecture—following GCT2 [168]—that the orbit closures 74 
and yp are completely captured by their representation-theoretic data. In that case, 
by showing that there’s no representation-theoretic obstruction to 74 C yp, we 
would have proved, nonconstructively, that there exists a polynomial-time algorithm 
for A! And for that reason, we shouldn’t be surprised if the algorithmic techniques 
that are used to solve A (matching, linear programming, or whatever) have already 
implicitly appeared in getting to this point. Indeed, we should be worried if they 
didn’t appear.°° 


It would be interesting to find a function in P, more natural than the P-complete H function, 
that’s completely characterized by its symmetries, and then try to understand explicitly why there’s 
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More broadly, Mulmuley has repeatedly stressed that P # NP is a “universal 
mathematical statement”: it says there’s no polynomial-time algorithm for 3SAT, 
no matter which area of math we tried to use to construct such an algorithm. And 
therefore, after we find that an easy “opening gambit’—like the diagonalization 
used to proved the unsolvability of the halting problem—doesn’t work, we shouldn’t 
be shocked if nearly every area of math ends up playing some role in the proof. 

Now, a natural reaction to this observation would be, not awe at the profundity of 


P= NP, but rather complete despair. Since math is infinite, and since the possible 
“ideas for polynomial-time algorithms” are presumably unbounded, why doesn’t 


the “universality” of P = NP mean that the task of proving could go on forever? 
At what point, according to the GCT philosophy, can we ever say “enough! we’ve 
discovered enough polynomial-time algorithms; now we’re ready to flip things 
around and proceed to proving P 4 NP”? 

The point I want to make is that the earlier considerations about y, and yp 
immediately suggest an answer to this question. Namely, we’re ready to stop when 
we’ve discovered nontrivial polynomial-time algorithms, not for all problems in 
P, but for all problems in P that are characterized by their symmetries. For let B 
be a problem in P that isn’t characterized by symmetries. Then the orbit closure 
Xz is contained in yp, and if we could prove that 7g C Xp, then we would’ve 
nonconstructively shown the existence of a polynomial-time algorithm for B. But 
our hypothetical P # NP proof doesn’t need to know about that. For since B isn’t 
characterized by symmetries, the GCT arguments aren’t going to be able to prove 
Xe © Xp anyway. 

The above would seem to motivate a thorough investigation of which functions 
in P (or NC’, etc.) can be characterized by their symmetries. If GCT is going to 
work at all, then the set of such functions, while presumably infinite, ought to be 
classifiable into a finite number of families. The speculation suggests itself that these 
families might roughly correspond to the different algorithmic techniques: Gaussian 
elimination, matching, linear programming, polynomial factorization, etc., and of 
course whatever other techniques haven’t yet been discovered. As a concrete first 
step toward these lofty visions, it would be interesting to find some example of a 
function in P that’s characterized by its symmetries, like the determinant is, and 
that’s in P only because of the existence of nontrivial polynomial-time algorithms 
for (say) matching or linear programming. 


6.6.7 Reservations 


The reader might have gotten the impression, from the last section, that I think GCT 


. 
is the most important advance on P = NP ever made. In this section I’ll set out a 
few of my reservations. 


no representation-theoretic obstruction to that function’s orbit closure being contained in yp— 
something we already know must be true. 
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First, it remains unclear whether we’ll be able to use representation theory, in 
the foreseeable future, not merely to prove Valiant’s Conjecture or P # NP, but 
to prove any new lower bound: for example, EXP ¢ ACC, or the permanent 
requiring arithmetic formulas of size Q (n*), or really anything that would convince 
skeptics of GCT’s power. I regard this as possible, and Grochow’s demonstration 
[96] that most existing circuit lower bounds can be phrased in vaguely GCT-like 
ways is evidence for it. On the other hand, if we want “full GCT’—meaning not just 
separating modules, but specifically representation-theoretic obstructions—then it’s 
also possible that the program is so challenging that getting it to work for anything 
is as hard as getting it to work for Valiant’s Conjecture. In practice, I predict that 
there will be a rush among theoretical computer scientists to learn GCT when, and 
only when, it’s used to prove some impressive new lower bound that had foiled other 
approaches. 

Second, the story of the border rank of the matrix multiplication tensor, set 
out in Sect.6.6.5, gives me pause, because it raises the possibility that even if 
representation-theoretic obstructions do exist, proving their existence will be even 
harder than proving complexity class separations in some more direct way. One 
possible “failure mode” for GCT is that, after centuries of struggle, mathematicians 
and computer scientists finally prove Valiant’s Conjecture and P # NP—and then, 
after further centuries of struggle, it’s shown that GCT could’ve proven these results 
as well (but only with quantitatively weaker bounds). 

Third, there are major aspects of complexity theory that GCT seems not to 
capture. For example, we saw in Sect. 6.6.5 that diagonalization—which despite 
reports of its demise, has reliably shown up in one lower bound after another 
over the decades, from the Time Hierarchy Theorem to time-space tradeoffs to 
NEXP ¢ ACC—seems to elude the GCT framework. A related point is that GCT, 
as it stands, has no way to take advantage of uniformity: for example, no way to 
prove P # NP, without also proving the stronger result NP ¢ P/poly. However, 
given that we can prove P # EXP but can’t even prove NEXP ¢ TC”, it seems 
conceivable that uniformity could help in proving P 4 NP. 

But perhaps my most important reservation is with a central argument that 
Mulmuley has offered for GCT in recent years [161, 163, 164]. His argument is 


that, even if GCT isn’t literally the only way forward on P = NP, still, the choice of 
GCT to go after explicit obstructions is in some sense provably unavoidable—and 
furthermore, GCT is the “simplest” approach to finding the explicit obstructions, 
so Occam’s Razor all but forces us to try GCT first. I completely agree that GCT 
represents a natural attack plan. It’s something to try. But I don’t think we have 
any theorem that can support the interpretation that GCT’s choices are inevitable, or 
“basically” inevitable. And I can easily envision that progress will come by freely 
“mixing and matching” GCT ideas with non-GCT ones. 

In more detail, we can identify at least four major successive decisions that GCT 
makes: 


(1) To prove P ¥ NP, we should start by proving Valiant’s Conjecture 66. 
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(2) To prove Valiant’s Conjecture, the natural approach is to prove Conjecture 82, 
about orbit closures. 

(3) To prove Conjecture 82, the natural approach is to find explicit representation- 
theoretic embedding obstructions. 

(4) To find those obstructions, we should start by finding faster algorithms (or 
algorithms in lower complexity classes) to learn about the multiplicities of 
irreps. 


All four decisions are reasonable, but not one is obvious. And of course, even 
if every proposition in a list has high probability individually (or high probability 
conditioned on its predecessors), their conjunction could have probability close to 
zero! 

As we saw in Sect.6.5, decision (1) predates GCT by decades, so there’s no 
need to revisit it here. Meanwhile, decision (2) seems to involve only a small 
strengthening of what needs to be proved, in return for a large gain in elegance. 
But there’s plenty to question about decisions (3) and (4). 

Regarding (3): we saw, in Sect. 6.6.5, that even if a given embedding of orbit 
closures is impossible, the reason might simply not be reflected in representation 
theory—and even if it is, it might be Harder to prove that there’s a representation- 
theoretic obstruction than that there’s some other obstruction, and one might get 
only a weaker lower bound that way. At least, that’s what seems to be true so far with 
the border rank of the matrix multiplication tensor. This would especially be an issue 
if it turned out that occurrence obstructions were rare or nonexistent, so that finding 
representation-theoretic obstructions would require counting the multiplicities of 
irreps and comparing them (rather than just checking whether the multiplicities are 
zero). 

But let me concentrate on (4). Is it clear that we must, in Mulmuley’s words, “go 
for explicitness”: that is, look for an efficient algorithm that takes a specific n and m 
as input, and tries to find a witness that it’s impossible to embed the padded n x n 
permanent into the m xm determinant? Why not just look directly for a proof, which 
(if we found it) would work for arbitrary n and m = n?)? 

Mulmuley’s argument for explicitness rests on what he calls the “flip theorems” 
[161]. These theorems, in his interpretation, assert that any successful approach to 
circuit lower bounds (not just GCT) will yield explicit obstructions as a byproduct. 
And thus, all GCT is doing is bringing into the open what any proof of Valiant’s 
Conjecture or NP ¢ P/poly will eventually need to confront anyway. 

Let me now state some of the flip theorems. First, building on a 1996 learning 
algorithm of Bshouty et al. [54], in 2003 Fortnow, Pavan, and Sengupta showed that, 
if NP-complete problems are hard at all, then there must be short lists of instances 
that cause all small circuits to fail. 


Theorem 90 (Fortnow et al. [82]). Suppose NP ¢ P/poly. Then for every n and 
k, there’s a list of 3SAT instances ,...,¢¢, of length at most € = n°, such 
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that every circuit C of size at most n* fails to decide at least one 9; in the list. 
Furthermore, such a list can be found in the class BPPNP 70 


Atserias [31] showed that, in the statement of Theorem 90, we can also swap the 
NP oracle for an oracle for the circuit C itself: in other words, the list g),..., @~ can 
also be found in the class BPP. 

Likewise, if the permanent is hard, then there must be short lists of matrices that 
cause all small arithmetic circuits to fail—and here the lists are much easier to find 
than they are in the Boolean case. 


Theorem 91 (Mulmuley [161, 163]). Suppose Valiant’s Conjecture 66 holds (i.e., 
the permanent has no polynomial-size arithmetic circuits, over finite fields F with 
|F| >> n). Then for every n and k, there’s a list of matrices A,,...,A¢ € F"*", of 
length at most £ = n°), such that for every arithmetic circuit C of size at most 
nk, there exists an i such that C (A;) # Per (A;). Indeed, a random list A,,...,Ae 
will have that property with 1 — o(1) probability. Furthermore, if polynomial 
identity testing has a black-box derandomization,’! then such a list can be found 
in deterministic n° time. 


While not hard to prove, Theorems 90 and 91 are conceptually interesting: they 
show that the entire hardness of 3SAT and of the permanent can be “concentrated” 
into a small number of instances. My difficulty is that this sort of “explicit obstruc- 
tion” to computing 3SAT or the permanent, seems barely related to the sorts of 
explicit obstructions that GCT is seeking. The obstructions of Theorems 90 and 91 
aren’t representation-theoretic; they’re simply lists of hard instances. Furthermore, 
a list like g,...,@¢ or Aj,...,Ag is not an easy-to-verify witness that 3SAT or the 
permanent is hard, because we’d still need to check that the list worked against all 
of the exponentially many n?“)-sized circuits. Having such a list reduces a two- 
quantifier (outs ) problem to a one-quantifier (NP) problem, but it still doesn’t put the 
problem in P—and we have no result saying that if, for example, the permanent is 
hard, then there must be obstructions that can be verified in n°“ time. Perhaps the 
best we can say is that, if we proved the permanent was hard, then we’d immediately 
get, for every n, an “obstruction” that could be both found and verified in 0 time 
steps! But for all we know, the complexity of finding provable obstructions could 
jump from exp (n°) to O as our understanding improved, without ever passing 
through n?), 

Thus, I find, GCT’s suggestion to look for faster obstruction-finding (or 
obstruction-recognizing) algorithms is a useful guide, a heuristic, a way to organize 


7Tn fact, the list can be found in the class ZPP™?, where ZPP stands for Zero-Error Probabilistic 
Polynomial-Time. This means that, whenever the randomized algorithm succeeds in constructing 
the list, it’s certain that it’s done so. 

“The polynomial identity testing problem was defined in Sect.5.4. Also, by a “black-box 
derandomization,” we mean a deterministic polynomial-time algorithm that outputs a hitting 
set: that is, a list of points x,,..., xe such that, for all small arithmetic circuits C that don’t 
compute the identically-zero polynomial, there exists an i such that C (x;) 4 0. What makes the 
derandomization “black-box” is that the choice of x,,..., xe doesn’t depend on C. 
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our thoughts about how on earth we’re going to find, say, an irrep p that blocks the 
embedding of the permanent into the determinant. But it’s not the only possible way 
forward. 


7 Conclusions 


Some will say that this survey’s very length, the bewildering zoo of approaches and 
variations and results and barriers that it covered, is a sign that no one has any real 


clue about the P = NP problem—or at least, that J don’t. Among those who think 
that, perhaps someone will write a shorter survey that points unambiguously to the 
right way forward! 

But it’s also possible that the business of proving brute-force search unavoidable 
seems complicated because it is complicated—because the reasons why brute-force 
search can be avoided in specific cases are as diverse as mathematics itself, and 
because here (unlike, say, for the halting problem), an argument ruling out all those 
reasons must be similarly deep and wide-ranging. I confess to limited sympathy for 
the idea that someone will just set aside everything that’s already known, think hard 
about the structural properties of the sets of languages accepted by deterministic and 
nondeterministic polynomial-time Turing machines, and find a property that holds 
for one set but not the other, thereby proving P 4 NP. For I keep coming back to 
the question: if a hands-off, aprioristic approach sufficed for P 4 NP, then why did 
it apparently not suffice for all the weaker separations that we’ve surveyed here? 

At the same time, I hope our tour of the progress in lower bounds has made 


the case that there’s no reason (yet!) to elevate P = NP to some plane of 
metaphysical unanswerability, or assume it to be independent of the axioms of 
set theory, or anything like that. The experience of complexity theory, including 
the superpolynomial lower bounds that people did manage to prove after struggle 


and effort, is consistent with P = NP being “merely a math problem’—albeit, a 
math problem that happens to be well beyond the current abilities of civilization, 
much like the solvability of the quintic in the 1500s, or Fermat’s Last Theorem 
in the 1700s. When we’re faced with such a problem, a natural response is to 
want to deepen our understanding of the entire subject (in this case, algorithms and 
computation) surrounding the problem—not merely because that’s a prerequisite to 
someday capturing the famous beast, but because regardless, the new knowledge 
gained along the way will hopefully find uses elsewhere. In our case, modern 
cryptography, quantum computing, and parts of machine learning could all be seen 


as flowers that bloomed in the garden of P = NP. 

Obviously I don’t know how P ¥ NP will ultimately be proved—if I did, this 
would be a very different survey! It seems plausible that a successful approach 
would yield the stronger result NP ¢ P/poly (i.e., that it wouldn’t take advantage 
of uniformity); that it would start by proving Valiant’s Conjecture (i.e., that the 
algebraic case would precede the Boolean one); and that it would draw on many 
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core areas of mathematics, but none of these things are even close to certain. The 
only prediction I feel confident in making is that the idea of “ironic complexity 
theory”—1.e., of a profound duality between upper and lower bounds, where the way 
to prove that there’s no fast algorithm for problem A is to discover fast algorithms 
for problems B, C, and D—is here to stay. As we saw, ironic complexity theory 
is at the core of the GCT program, but it’s also at the core of Williams’s proof of 
NEXP ¢ ACC, which in other respects is about as far from GCT as possible. The 
natural proofs barrier also provides a sort of contrapositive to ironic complexity, 
showing how the nonexistence of efficient algorithms is often what prevents us from 
proving lower bounds. If 3SAT is ever placed outside P, I’m willing to bet that the 
proof will place many other problems inside P—or at any rate, in smaller complexity 
classes than previously known. 

So, if we take an optimistic attitude (optimistic about proving intractability!), 
then which breakthroughs should we seek next? What’s on the current horizon? 
There are hundreds of possible answers to that question—we’ve already encoun- 
tered some in this survey—but if I had to highlight a few: 


¢ Prove lower bounds against nonuniform TC =r example, by finding a better- 
than-brute-force algorithm for the neural network satisfiability problem.’ 

* Prove a lower bound better than n!4/2! on the rank of an explicit d-dimensional 
tensor, or construct one of the many other algebraic or combinatorial objects 
(rigid matrices, elusive functions, etc.) that are known to imply new circuit lower 
bounds. 

e Advance proof complexity to the point where we could, for example, prove a 
superpolynomial lower bound on the number of steps needed to convert some 
n-input, polynomial-size Boolean circuit C into some equivalent circuit C’, 
via moves that each swap out a size-O (1) subcircuit for a different size-O (1) 
subcircuit with identical input/output behavior.”* 

¢ Prove a superpolynomial lower bound on the number of 2-qubit quantum 
gates needed to implement some explicit n-qubit unitary transformation U. 
(Remarkably, as far as anyone knows today, one could succeed at this without 
needing to prove any new classical circuit lower bound. On the other hand, it’s 
plausible that one would need to overcome a unitary version of the natural proofs 
barrier.) 


In 1999, Allender [19] showed that the permanent, and various other natural #P-complete 
problems, can’t be solved by LOGTIME-uniform TC° circuits: in other words, constant-depth 
threshold circuits for which there’s an O (logn)-time algorithm to output the ith bit of their 
description, for any i. Indeed, these problems can’t be solved by LOGTIME-uniform TC? circuits 
of size f (n), where f is any function that can yield an exponential when iterated a constant number 
of times. The proof uses a hierarchy theorem, it would be interesting to know whether it relativizes. 
Examples are deleting two successive NOT gates, or applying de Morgan’s laws. By the 
completeness of Boolean algebra, one can give local transformation rules that suffice to convert 
any n-input Boolean circuit into any equivalent circuit using at most exp (7) moves. 

From talking to experts, this problem seems closely related to the problem of proving 
superpolynomial lower bounds for so-called Frege proofs, but is possibly easier. 
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¢ Clarify whether ACC has a natural proofs barrier, and prove ACC lower bounds 
for problems in EXP or below. 

¢ Clarify whether there’s an arithmetic natural proofs barrier, or something else 
preventing us from crossing the “chasm at depth three.” 

¢ Prove any lower bound on D(n), the determinantal complexity of the n x n 
permanent, better than Mignon and Ressayre’s Q (n?/2). 

¢ Derandomize polynomial identity testing—or failing that, prove some derandom- 
ization theorem that implies a strong new circuit lower bound. 

¢ Discover a new class of polynomial-time algorithms, especially but not only for 
computing the multiplicities of irreps. 

¢ Pinpoint additional special properties of 3SAT or the permanent that could be 
used to evade the natural proofs barrier, besides the few that are already known, 
such as symmetry-characterization and the ability to simulate machines in a 
hierarchy theorem. 

e Prove any interesting new lower bound by finding a representation-theoretic 
obstruction to an embedding of orbit closures. 

¢ Perhaps my favorite challenge: find new, semantically-interesting ways to “hob- 
ble” the classes of polynomial-time algorithms and polynomial-size circuits, 
besides the ways that have already been studied, such as restricted mem- 
ory, restricted circuit depth, monotone gates only, arithmetic operations only, 
and restricted families of algorithms (such as DPLL and certain linear and 
semidefinite programming relaxations). Any such restriction that one discovers 
is effectively a new slope that one can try to ascend up the P # NP mountain, 
possibly gentler than the previous slopes. 


Going even further on a limb, P’ve long wondered whether massive computer 
search could give any insight into complexity lower bound questions, beyond the 
relatively incidental ways it’s been used for this purpose already (e.g., [237]). For 
example, could we feasibly discover the smallest arithmetic circuits to compute 
the permanents of 4 x 4 and 5 x 5 and 6 x 6 matrices? And if we did, would 
examination of those circuits yield any clues about what to prove for general n? 
The conventional wisdom has always been “no” to both questions. For firstly, as far 
as anyone knows today, the computational complexity of such a search will grow 
not merely exponentially but doubly exponentially with n (assuming the optimal 
circuits that we’re trying to find grow exponentially themselves), and examination 
of the constants suggests that n = 4 might already be out of reach. And secondly, 
even if we knew the optimal circuits, they’d tell us nothing about the existence or 
nonexistence of clever algorithms that start to win only at much larger values of n. 
After all, many of the theoretically efficient algorithms that we know today only 
overtake the naive algorithms for n’s in the thousands or millions.” 


“There are even what Richard Lipton has termed “galactic algorithms” [147], which beat their 
asymptotically-worse competitors, but only for values of n that likely exceed the information stor- 
age capacity of the galaxy or the observable universe. The currently-fastest matrix multiplication 
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These arguments have force. Even so, I think we should be open to the possibility 
that someday, advances in raw computer power, and especially theoretical advances 
that decrease the effective size of the search space, might change the calculus and 
open up the field of “experimental complexity theory.” One way that could happen 
would be if breakthroughs in GCT, or some other such program, brought the quest 
for “explicit obstructions” (say, to the 1000 x 1000 permanent having size-107° 
circuits) within the range of computer search, if still not within the range of the 
human mind. 

Or of course, the next great advance might come from some direction that I didn’t 
even mention here, whether because no one grasps its importance yet or simply 
because 7 don’t. But regardless of what happens, the best fate this survey could 
possibly enjoy would be to contribute, in some tiny way, to making itself outdated. 


Appendix: Glossary of Complexity Classes 


To help you remember all the supporting characters in the ongoing soap opera 
of which P and NP are the stars, this appendix contains short definitions of the 
complexity classes that appear in this survey, with references to the sections where 
the classes are discussed in more detail. For a fuller list, containing over 500 classes, 
see for example my Complexity Zoo [8]. All the classes below are classes of 
decision problems—that is, languages L C {0, 1}*. The known inclusion relations 
among many of the classes are also depicted in Fig. 3. 


algorithms might fall into this category, although the constants don’t seem to be known in enough 
detail to say for sure. 
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Il : coNP?, the second level of the polynomial hierarchy (with universal 
quantifier in front). See Sect. 2.2.3. 

ae : NPP the second level of the polynomial hierarchy (with existential quanti- 
fier in front). See Sect. 2.2.3. 

AC°: The class decidable by a nonuniform family of polynomial-size, constant- 
depth, unbounded-fanin circuits of AND, OR, and NOT gates. See Sect. 6.2.3. 
AC® [m]: | AC° enhanced by MOD m gates, for some specific value of m (the case 
of m a prime power versus a non-prime-power are dramatically different). See 

Sect. 6.2.4. 

ACC: AC° enhanced by MOD m gates, for every m simultaneously. See 
Sect. 6.4.2. 

BPP: Bounded-Error Probabilistic Polynomial-Time. The class decidable by a 
polynomial-time randomized algorithm that errs with probability at most 1/3 on 
each input. See Sect. 5.4.1. 

BQP: Bounded-Error Quantum Polynomial-Time. The same as BPP except that 
we now allow quantum algorithms. See Sect. 5.5. 

coNP: The class consisting of the complements of all languages in NP. Complete 
problems include unsatisfiability, graph non-3-colorability, etc. See Sect. 2.2.3. 

DTISP (f (n),g(n)): See Sect. 6.4.1. 


EXP: Exponential-Time, or (J, TIME (2"). Note the permissive definition of 
“exponential,” which allows any polynomial in the exponent. See Sect. 2.2.7. 
EXPSPACE: Exponential-Space, or |), SPACE (2F), See Sect. 2.2.7. 


IP: Interactive Proofs, the class for which a “yes” answer can be proven (to 
statistical certainty) via an interactive protocol in which a polynomial-time 
verifier Arthur exchanges a polynomial number of bits with a computationally- 
unbounded prover Merlin. Turns out to equal PSPACE [200]. See Sect. 6.3.1. 

LOGSPACE: Logarithmic-Space, or SPACE (logn). Note that only read/write 
memory is restricted to O (log) bits; the n-bit input itself is stored in a read- 
only memory. See Sect. 6.4.1. 

MA:  Merlin-Arthur, the class for which a “yes” answer can be proven to statistical 
certainty via a polynomial-size message from a prover (“Merlin”), which the 
verifier (“Arthur”) then verifies in probabilistic polynomial time. Same as NP 
except that the verification can be probabilistic. See Sect. 6.3.2. 

MAexp: The exponential-time analogue of MA, where now Merlin’s proof can be 
20 bits long, and Arthur’s probabilistic verification can also take 2 time. 
See Sect. 6.3.2. 

NC!: The class decidable by a nonuniform family of polynomial-size Boolean 
formulas—or equivalently, polynomial-size Boolean circuits of fanin 2 and depth 
O (logn). The subclass of P/poly that is “highly parallelizable.” See Sect. 5.2. 


NEXP: Nondeterministic Exponential-Time, or ), NTIME (2"') . The exponential- 
time analogue of NP. See Sect. 2.2.7. 
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NP: Nondeterministic Polynomial-Time, or J, NTIME (n‘). The class for which 
a “yes” answer can be proven via a polynomial-size witness, which is verified by 
a deterministic polynomial-time algorithm. See Sect. 2. 

NTIME (f (n)): | Nondeterministic f (n)-Time. The class for which a “‘yes” answer 
can be proven via an O (f (n))-bit witness, which is verified by a deterministic 
O (f (n))-time algorithm. Equivalently, the class solvable by a nondeterministic 
O (f (n))-time algorithm. See Sect. 2.2.7. 

P: Polynomial-Time, or J, TIME (n*). The class solvable by a deterministic 
polynomial-time algorithm. See Sect. 2. 

p#P: P with an oracle for #P problems (i.e., for counting the exact number of 
accepting witnesses for any problem in NP). See Sect. 2.2.6. 

P/poly: P enhanced by polynomial-size “advice strings” {a,},, which depend 
only on the input size n but can otherwise be chosen to help the algorithm as 
much as possible. Equivalently, the class solvable by a nonuniform family of 
polynomial-size Boolean circuits (i.e., a different circuit is allowed for each input 
size n). See Sect. 5.2. 

PH: The Polynomial Hierarchy. The class expressible via a polynomial-time 
predicate with a constant number of alternating universal and existential quan- 
tifiers over polynomial-size strings. Equivalently, the union of pai = NP, 
uke = coNP, = = NPP, me = coNP™? and so on. See Sect. 2.2.3. 

PP: Probabilistic Polynomial-Time. The class decidable by a polynomial-time 
randomized algorithm that, for each input x, guesses the correct answer with 
probability greater than 1/2. Like BPP but without the bounded-error (1/3 
versus 2/3) requirement, and accordingly believed to be much more powerful. 
See Sect. 2.2.6. 

PSPACE: Polynomial-Space, or ), SPACE (n“). See Sect. 2.2.5. 

SPACE (f (n)): The class decidable by a serial, deterministic algorithm that uses 
O (f (n)) bits of memory (and possibly up to 2°/™ time). See Sect. 2.2.7. 

TC°: AC° enhanced by MAJORITY gates. Also corresponds to “neural networks” 
(polynomial-size, constant-depth circuits of threshold gates). See Sect. 6.2.5. 
TIME (f (n)): The class decidable by a serial, deterministic algorithm that uses 
O (f (n)) time steps (and therefore, O (f (n)) bits of memory). See Sect. 2.2.7. 
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From Quantum Systems to L-Functions: 
Pair Correlation Statistics and Beyond 


Owen Barrett, Frank W. K. Firk, Steven J. Miller, 
and Caroline Turnage-Butterbaugh 


Abstract The discovery of connections between the distribution of energy levels 
of heavy nuclei and spacings between prime numbers has been one of the most 
surprising and fruitful observations in the twentieth century. The connection 
between the two areas was first observed through Montgomery’s work on the 
pair correlation of zeros of the Riemann zeta function. As its generalizations and 
consequences have motivated much of the following work, and to this day remains 
one of the most important outstanding conjectures in the field, it occupies a central 
role in our discussion below. We describe some of the many techniques and results 
from the past sixty years, especially the important roles played by numerical and 
experimental investigations, that led to the discovery of the connections and progress 
towards understanding the behaviors. In our survey of these two areas, we describe 
the common mathematics that explains the remarkable universality. We conclude 
with some thoughts on what might lie ahead in the pair correlation of zeros of 
the zeta function, and other similar quantities. 


1 Introduction 


Montgomery’s pair correlation conjecture posits that zeros of L-functions behave 
similarly to energy levels of heavy nuclei. The bridge between these fields is random 
matrix theory, a beautiful subject which has successfully modeled a large variety 
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of diverse phenomena (see [7, 87] for a great example of how varied the systems 
can be). It is impossible in a short chapter to cover all the topics and connections; 
fortunately there is no need as there is an extensive literature. Our goal is therefore 
to briefly describe the history of the subject and the correspondences, concentrating 
on some of the main objects of interest and past successes, ending with a brief tour 
through a subset of current work and a discussion of some of the open questions 
in mathematics. We are deliberately brief in areas that are well known or are 
extensively covered in the literature, and instead dwell at greater lengths on the 
inspiration from and interpretation through physics (see for example Sect. 2.6), as 
these parts of the story are not as well known but deserve to be (both for historical 
reasons as well as the guidance they can offer). 

To this end, we begin with a short introduction to random matrix theory and a 
quick description of the main characters studied in this chapter. We then continue 
in Sect. 2 with a detailed exposition of the historical development of random matrix 
theory in nuclear physics in the 1950s and 1960s. We note the pivotal role played 
by the nuclear physics experimentalists in gathering data to support the theoretical 
conjectures; we will see analogues of these when we get to the work in the 
1970s and 1980s on zeros of L-functions in Sect.3.5. One of our main purposes 
is in fact to highlight the power of experimental data, be it data from a lab or a 
computer calculation, and show how attempts to explain such results influence the 
development and direction of subjects. We then shift emphasis to number theory 
in Sect. 3, and see how studies on the class number problem led Montgomery to 
his famous pair correlation conjecture for the zeros of the Riemann zeta function. 
This and related statistics are the focus of the rest of the chapter; we describe 
what they are, what progress has been made (theoretically and numerically), and 
then turn to some open questions. Most of these open questions involve how the 
arithmetic of Z-functions influences the behavior; remarkably the main terms in a 
variety of problems are independent of the finer properties of Z-functions, and it is 
only in lower order terms (or, equivalently, in the rates of convergence to the random 
matrix theory behavior) that the dependencies on these properties surface. We then 
conclude in Sect. 4 with current questions and some future trends. 


1.1 The Early Days: Statistics and Biometrics 


Though our main characters will be energy levels of nuclei and zeros of L-functions, 
the story of random matrix theory begins neither with physics nor with mathematics, 
but with statistics and biometrics. In 1928 John Wishart published an article 
titled The Generalised Product Moment Distribution in Samples from a Normal 
Multivariate [150] in Biometrika (see [149] for a history of the journal, which we 
briefly recap). The journal was founded at the start of the century by Francis Galton, 
Karl Pearson, and Walter Weldon for the study of statistics related to biometrics. In 
the editors’ introduction in the first issue (see also [149]), they write: 
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It is intended that Biometrika shall serve as a means not only of collecting or publishing 
under one title biological data of a kind not systematically collected or published elsewhere 
in any other periodical, but also of spreading a knowledge of such statistical theory as may 
be requisite for their scientific treatment. 


The question of interest for Wishart was that of estimating covariance matrices. 
The paper begins with a review of work to date on samples from univariate 
and bivariate populations, and issues with the determination of correlation and 
regression coefficients. After summarizing some of the work and formulas from 
Fisher, Wishart writes: 


The distribution of the correlation coefficient was deduced by direct integration from this 
result. Further, K. Pearson and V. Romanovsky, starting from this fundamental formula, 
were able to deal with the regression coefficients. Pearson, in 1925, gave the mean value 
and standard deviation of the regression coefficient, while Romanovsky and Pearson, in the 
following year, published the actual distribution. 


After talking about the new problems that arise when dealing with three or more 
variates, he continues: 


What is now asserted is that all such problems depend, in the first instance, on the 
determination of a fundamental frequency distribution, which will be a generalisation of 


Eq. (1.2). It will, in fact, be the simultaneous distribution in samples of the n variances 


(squared standard deviations) and the nn) product moment coefficients. It is the purpose 


of the present paper to give this generalised distribution, and to calculate its moments up to 
the fourth order. The case of three variates will first be considered in detail, and thereafter a 
proof for the general n-fold system will be given. 


In his honor the distribution of the sample covariance matrix (arising from a 
sample from a multivariate normal distribution) is called the Wishart distribution. 
More specifically, if we have an n x p matrix X whose rows are independently drawn 
from a p-variate mean O normal distribution, the Wishart distribution is the density 
of the p x p matrices X7X. 

Several items are worth noting here. First, we have an ensemble (a collection) 
of matrices whose entries are drawn from a fixed distribution; in this case there 
are dependencies among the entries. Second, these matrices are used to model 
observable quantities of interest, in this case covariances. Finally, in his article 
he mentions an earlier work of his (published in the Memoirs of the Royal 
Meteorological Society, volume I, pages 29-37, 1928) which experimentally 
confirmed some of the results discussed, thus showing the connections between 
experiment and theory which play such a prominent role later in the story also 
played a key role in the founding. 

It was not until almost thirty years later that random matrix theory, in the hands 
and mind of Wigner, bursts onto the physics scene, and then it will be almost 
another thirty years more before the connections with number theory emerge. Before 
describing these histories in detail, we end the introduction with a very quick tour 
of some of the quantities and objects we’ll meet. 
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1.2. Cast of Characters: Nuclei and L-functions 


The two main objects we study are energy levels of heavy nuclei on the physics 
side, and zeros of the Riemann zeta function (or more generally L-functions) on 
the number theory side, especially Montgomery’s pair correlation conjecture and 
related statistics. We give a full statement of the pair correlation conjecture, and 
results towards its proof, in Sect. 3.2. Briefly, given an ordered sequence of events 
(such as zeros on the critical line, eigenvalues of Hermitian matrices, energy levels 
of heavy nuclei) one can look at how often a difference is observed. The remarkable 
conjecture is that these very different systems exhibit similar behavior. 

We begin with a review of some facts about the these areas, from theories for 
their behavior to how experimental observations were obtained which shed light on 
the structures, and then finish the introduction with some hints at the similarities 
between these two very different systems. Parts of that section, as well as much 
of Sect.2, are expanded with permission from the survey article [50] written by 
two of the authors of this chapter for the inaugural issue of the open access journal 
Symmetry. The goal of that article was similar to this chapter, though there the main 
quantity discussed was Wigner’s semi-circle law and not pair correlation. 

Many, if not all, of the other survey articles in the subject concentrate on the 
mathematics and ignore the experimental physics. When writing the survey [50] the 
authors deliberately sought a balance, with the intention of sharing and elaborating 
on that vantage again in a later work to give a wider audience a more complete 
description of the development of the subjects, as other approaches are already 
available in the literature. We especially recommend to the reader Goldston’s 
excellent survey article Notes on pair correlation of zeros and prime numbers (see 
[64]) for an extended, detailed technical discussion; the purpose of this chapter is 
to complement this and other surveys by highlighting other aspects of the story, 
especially how Montgomery’s work on the pair correlation of zeros of €(s) connects, 
through random matrix theory, a central object of study in number theory to our 
understanding of the physics of heavy nuclei. 


1.2.1. Atomic Theory and Nuclei 


Experiments and experimental data played a crucial role in our evolving understand- 
ing of the atom. For example, Ernest Rutherford’s gold foil experiment (performed 
by Hans Geiger and Ernest Marsden) near the start of the twentieth century 
demonstrated that J. J. Thomson’s plum pudding model of an atom with negatively 
charged electrons embedded in a positively charged region was false, and that the 
atom had a very small positively charged nucleus with the electrons far away. These 
experiments involved shooting alpha particles at thin gold foils. Alpha particles 
are helium atoms without the electrons and are thus positively charged. While 
this positive charge was responsible for disproving the plum pudding model, such 
particles could not deeply probe the positively charged nucleus due to the strong 
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repulsion from like charges. To make further progress into the structure of the 
atom in general, and the nucleus in particular, another object was needed. A great 
candidate was the neutron (discovered by Chadwick in 1932); as it did not have a 
net charge, the electric force would play an immensely smaller role in its interaction 
with the nucleus than it did with the alpha particles. 

The earliest studies of neutron induced reactions showed that the total neutron 
cross section! for the interaction of low-energy (electron-volt, eV) neutrons with 
a nucleus is frequently much greater than the geometrical area presented by the 
target nucleus to the incident neutron [44]. It was also found that the cross section 
varies rapidly as a function of the bombarding energy of the incident neutron. The 
appearance of these well-defined resonances in the neutron cross section is the most 
characteristic feature of low energy nuclear reactions. 

In general, the low energy resonances were found to be closely spaced (spacing 
< 10eV in heavy nuclei), and to be very narrow (widths < 0.1 eV). These facts 
led Niels Bohr to introduce the compound nucleus model [15] that assumes the 
interaction between an incoming neutron and the target nucleus is so strong that 
the neutron rapidly shares its energy with many of the target nucleons. The nuclear 
state that results from the combination of incident neutron and target nucleus may 
therefore last until sufficient energy again resides in one of the nucleons for it to 
escape from the system. This is a statistical process, and a considerable time may 
elapse before it occurs. The long lifetime of the state (t) (on a nuclear timescale) 
explains the narrow width (J”) of the resonance.” Also, since many nucleons are 
involved in the formation of a compound state, the close spacing of the resonances 
is to be expected since there are clearly many ways of exciting many nucleons. 
The qualitative model outlined above has formed the basis of most theoretical 
descriptions of low-energy, resonant nuclear reactions [11]. 

If a resonant state can decay in a number of different ways (or channels), we 
can ascribe a probability per unit time for the decay into a channel, c, which can be 
expressed as a partial width [),. The total width is the sum of the partial widths, 
1.€., Ty = ». Dye. 

The appearance of well-defined resonances occurs in heavy nuclei (mass number 
A = 100, say) for incident neutron energies up to about 100 keV, and in light nuclei 
up to neutron energies of several MeV. As the neutron bombarding energies are 
increased above these energies, the total cross sections are observed to become 
smoother functions of neutron energy [81]. This is due to two effects: firstly, 


' A total neutron cross section is defined as 


Number of events of all types per unit time per nucleus 


Number of incident neutrons per unit time per unit area’ 


and has the dimensions of area (the standard unit is the barn, 10~4 cm?). 


2The width, I”, is related to the lifetime, r, by the uncertainty relation 1 = h/2zt, where h is 
Planck’s constant. The finite width (lack of energy definition) is due to the fact that a resonant 
state can decay by emitting a particle, or radiation, whereas a state of definite energy must be a 
stationary state. 
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the level density (i.e., the number of resonances per unit energy interval) increases 
rapidly as the excitation energy of the compound nucleus is increased, and secondly, 
the widths of the individual resonances tend to increase with increasing excitation 
energy so that, eventually, they overlap. The smoothed-out cross sections provide 
useful information on the average properties of resonances. One of the most 
significant features of these cross sections is the appearance of gross fluctuations 
that have been interpreted in terms of the single-particle nature of the neutron- 
nucleus interaction [92]. These giant resonances form one of the main sources 
of experimental evidence for introducing the successful optical model of nuclear 
reactions. This model represents the interaction between a neutron and a nucleus 
in terms of the neutron moving in a complex potential well [118] in which the 
imaginary part allows for the absorption of the incident neutron. 

Experimental results show that, on increasing the bombarding energy above 
about 5 MeV, a different reaction mechanism may occur. For example, the energy 
spectra of emitted nucleons frequently contain too many high-energy nucleons 
compared with the predictions of the compound nucleus model. The mechanism no 
longer appears to be one in which the incident neutron shares its energy with many 
target nucleons but is one in which the neutron interacts with a single nucleon or, 
at most, a few nucleons. Such a mechanism is termed a direct interaction, which is 
defined as a nuclear reaction in which only a few of the available degrees of freedom 
of the system are involved [6]. 

The optical model, mentioned above, is an important example of a direct 
interaction that takes place even at low bombarding energies. The incident neutron 
is considered to move in the mean nuclear potential of all the nucleons in the target. 
This model also has been used to account for anomalies in the spectra of gamma- 
rays resulting from thermal neutron capture [89, 91]. 

At even higher bombarding energies, greater than 50 MeV, say, the mechanism 
becomes clearer in the sense that direct processes are the most important. The 
reactions then give information on the fundamental nucleon-nucleon interaction; 
these studies and their interpretation are, however, outside the scope of the present 
discussion. 

When a low-energy neutron (energy < 10keV, say) interacts with a nucleus 
the excitation energy of the compound nucleus is greatly increased by the neu- 
tron binding energy that typically ranges from 5 to 10 MeV. In the late 1950s, 
experimental methods were developed for measuring low-energy neutrons with 
resolutions of a few electron-volts. This meant that, for the first time in any physical 
system, it became possible to study the fine structure of resonances at energies 
far above the ground state of the system. The relevant experimental methods are 
discussed in Sect.2. Important information was thereby obtained concerning the 
properties that characterizes the resonances such as their peak cross sections, elastic 
scattering widths, and adjacent spacing. The results were used to test the predictions 
of various nuclear models used to describe the interactions. These models ranged 
from the Fermi Gas Model, a quantized version of classical Statistical Mechanics 
and Thermodynamics [10], to the sophisticated Nuclear Shell Model [11]. In the 
mid-1950s, all Statistical Mechanics Models predicted that the spacing distribution 
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of nearest-neighbor resonances of the same spin and parity in a heavy nucleus 
(mass number A > 100, say) was an exponential distribution. By 1956, the 
experimental evidence on the spacing distribution of s-wave resonances in a number 
of heavy nuclei indicated a lack of very closely-spaced resonances, contradicting 
the predictions of an exponential distribution [71]. By 1960, two research groups 
[48, 124] showed, unequivocally, that the spacing distribution of resonances up to 
an energy of almost 2keV followed the prediction of the random matrix model 
surmised by Wigner in 1956 [147]; in his model the probability of a zero spacing 
is zero! It is a model rooted in statistics, which interestingly is where our story on 
random matrix theory began! 


1.2.2 L-Functions and Their Zeros 


There are many excellent introductions, at a variety of levels, to number theory and 
L-functions. We assume the reader is familiar with the basics of the subject; for more 
details see among others [27, 39, 70, 82, 106, 129]. The discussion below is a quick 
review and is an abridgement (and slight expansion) of [50], which has additional 
details. 

The primes are the building blocks of number theory: every integer can be written 
uniquely as a product of prime powers. Note that the role played by the primes 
mirrors that of atoms in building up molecules. One of the most important questions 
we can ask about primes is also one of the most basic: how many primes are there 
at most x? In other words, how many building blocks are there up to a given point? 

Euclid proved over 2000 years ago that there are infinitely many primes; so, if 
we let (x) denote the number of primes at most x, we know lim,_,55 a(x) = oo. 
Though Euclid’s proof is still used in courses around the world (and gives a growth 
rate on the order of log log x), one can obtain much better counts on (x). 

The prime number theorem states that the number of primes at most x is 
Li(x) + o(Li(x)), where Li(x) = C dt/ log t and for x large, Li(x) is approximately 
x/logx, and f(x) = o(g(x)) means lim,-,.. f(x)/g(x) = 0. While it is possible to 
prove the prime number theorem elementarily [43, 128], the most informative proofs 
use complex numbers and complex analysis, and lead to the fascinating connection 
between number theory and nuclear physics. One of the most fruitful approaches to 
understanding the primes is to understand properties of the Riemann zeta function, 
f(s), which is defined for Re(s) > 1 by 


t(s) == =; (1.1) 


the series converges for Re(s) > 1 by the integral test. By unique factorization, we 
may also write ¢(s) as a product over primes. To see this, use the geometric series 
formula to expand (1 — p~*)~! as )°?2.) p~™ and note that n~* occurs exactly once 
on each side (and clearly every term from expanding the product is of the form n~* 
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for some n). This is called the Euler product of ¢(s), and is one of its most important 
properties: 


C(s) = eae = || (-4) (1.2) 


p prime 


Initially defined only for Re(s) > 1, using complex analysis the Riemann zeta 
function can be meromorphically continued to all of C, having only a simple pole 
with residue 1 at s = 1. It satisfies the functional equation 


E(s) = sis— DF (5) x46) = E(1—s). (1.3) 


One proof is to use the Gamma function, I"(s) = fi °° etl dt. A simple change of 
variables gives 


[oe leary = ce ) n'a 9/2. (1.4) 


Summing over n represents a multiple of ¢(s) as an integral. After some algebra 


we find 
r (5) f(s) = [8 towar+ [vie (<) dx, (1.5) 


: aye : . : 
with w(x) = )(P2, e~” ™*. Using Poisson summation, we see 


=) = 54-234 + stot) (1.6) 
ae (ae = a a + x2@(x), a 


which yields 


mr (5 Je) = = aot [wit heeds (1.7) 


from which the claimed functional equation follows. 

The distribution of the primes is a difficult problem; however, the distribution of 
the positive integers is not and has been completely known for quite some time! The 
hope is that we can understand }*, 1/n' as this involves sums over the integers, and 
somehow pass this knowledge on to the primes through the Euler product. 

Riemann [123] (see [19, 39] for an English translation) observed a fascinating 
connection between the zeros of ¢(s) and the error term in the prime number 
theorem. As this relation is the starting point for our story on the number theory 
side, we describe the details in some length. One of the most natural things to do 
to a complex function is to take contour integrals of its logarithmic derivative; this 
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yields information about zeros and poles, and we will see later in (1.17) that we can 
get even more information if we weigh the integral with a test function. There are 
two expressions for ¢(s); however, for the logarithmic derivative it is clear that we 
should use the Euler product over the sum expansion, as the logarithm of a product 
is the sum of the logarithms. Let 


logp ifn =p’ for some integer r 


A(n) = (1.8) 
0 otherwise. 
We find 
f(s) _ wa logp-p* A) 
fy = Lape le 


(this is proved by using the geometric series formula to write (1 — p~*)~! as 
> p20 1/p’, collecting terms and then using the definition of A(n)). Moving the 
negative sign over and multiplying by x*/s, we find 


1 maOrs 
2ni de (8) s ia =a / Aw (; on 


(c) nsx 


where we are integrating over some line Re(s) = c > 1. The integral on the right 
hand side is 1 if n < x and 0 if n > x (by choosing x non-integral, we do not need 
to worry about x = n), and thus gives )~,, <, A(v). By shifting contours and keeping 
track of the poles and zeros of ¢(s), the residue theorem implies that the left hand 
side is 


x- a (1.11) 


p:f(p)=0 p 


the x term comes from the pole of ¢(s) at s = 1 (remember we count poles with a 
minus sign), while the x°/ term arises from zeros; in both cases we must multiply 
by the residue, which is x°/ p (it can be shown that f(s) has neither a zero nor a pole 
at s = 0). Some care is required with this sum, as }> 1/|p| diverges. The solution 
involves pairing the contribution from p with p; see for example [27]. 

The Riemann zeta function vanishes whenever p is a negative even integer; 
we call these the trivial zeros. These terms contribute )°72_, x 7*/(2k) = —5 
log(1 — x”). This leads to the following beautiful formula, known as the explicit 
formula: 


xP] _9 
x >» — — 5 log(l - )= VA@) (1.12) 
sa, oa nsx 
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If we write n as p’, the contribution from all p” pieces with r > 2 is bounded by 
2x!/? log x for x large, thus we really have a formula for the sum of the primes at 
most x, with the prime p weighted by log p. Through partial summation, knowing 
the weighted sum is equivalent to knowing the unweighted sum. 

We can now see the connection between the zeros of the Riemann zeta function 
and counting primes at most x. The contribution from the trivial zeros is well- 
understood, and is just —} log(1 — x’). The remaining zeros, whose real parts are 
in [0, 1], are called the non-trivial or critical zeros. They are far more important 
and more mysterious. The smaller the real part of these zeros of €(s), the smaller 
the error. Due to the functional equation, however, if (0) = 0 for a critical zero 
p then €(1 — pe) = 0 as well. Thus the ‘smallest’ the real part can be is 1/2. This 
is the celebrated Riemann Hypothesis (RH), which is probably the most important 
mathematical aside ever in a paper. Riemann [19, 39, 123] wrote (translated into 
English; note when he talks about the roots being real, he’s writing the roots as 
1/2 + iy, and thus y € R is the Riemann Hypothesis): 


One now finds indeed approximately this number of real roots within these limits, and it is 
very probable that all roots are real. Certainly one would wish for a stricter proof here; I 
have meanwhile temporarily put aside the search for this after some fleeting futile attempts, 
as it appears unnecessary for the next objective of my investigation. 


Though not mentioned in the paper, Riemann had developed a terrific formula 
for computing the zeros of ¢(s), and had checked (but never reported!) that the first 
few were on the critical line Re(s) = 1/2. His numerical computations were only 
discovered decades later when Siegel was looking through Riemann’s papers. 

RH has a plethora of applications throughout number theory and mathematics; 
counting primes is but one of many. The prime number theorem is in fact equivalent 
to the statement that Re(p) < 1 for any zero of ¢(s), and was first proved 
independently by Hadamard [69] and de la Vallée Poussin [28] in 1896. Each proof 
crucially used results from complex analysis, which is hardly surprising given that 
Riemann had shown z(x) is related to the zeros of the meromorphic function ¢(s). 
It was not until almost 50 years later that Erdés [43] and Selberg [128] obtained 
elementary proofs of the prime number theorem (in other words, proofs that did not 
use complex analysis, which was quite surprising as the prime number theorem was 
known to be equivalent to a statement about zeros of a meromorphic function). See 
[62] for some commentary on the history of elementary proofs. It is clear, however, 
that the distribution of the zeros of the Riemann zeta function will be of primary (in 
both senses of the word!) importance. 

The Riemann zeta function is the first of many similar functions that we can 
study. We assume the reader has seen L-functions before; in addition to the surveys 
mentioned earlier, see also the introductory remarks in [83, 127]. We can examine, 
for the real part of s sufficiently large, 


CO 


L(s,f) = ee (1.13) 


Ss 
n=1 
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of course, while we can create such a function for any sequence {a,(n)} of sufficient 
decay, only certain choices will lead to useful objects whose zeros encode the 
solution to questions of arithmetic interest. For example, if we chose a, arising from 
Dirichlet characters we obtain information about primes in arithmetic progression, 
while taking a¢(p) to count the number of solutions to an elliptic curve y= 
x3 + Ax + B modulo p yields information about the rank of the group of rational 
solutions. 

Our previous analysis, where many of our formulas are due to taking the 
logarithmic derivative and computing a contour integral, suggests that we insist that 
an Euler product hold: 


(oe) 


L(s,f) = = = [| 4G. (1.14) 


s 
n=1 Pp prime 


Further, we want a functional equation relating the values of the completed 
L-function at s and 1 — s, which allows us to take the series expansion that originally 
converges only for real part of s large and obtain a function defined everywhere: 


AGS.) = Lol(s JL(s.f) = ¢Ad—s,f), (1.15) 


where é,, the sign of the functional equation, is of absolute value 1, and 


L,(s,f) 


II 


d 
[[ = arte) 
j=l 


II 


Loo(s,f) = ATL (5 if a) (1.16) 
j=l 


with A # 0 a complex number, Q > 0, ay; > 0 and )"_, ay,(p)” = af(p’). For 
‘nice’ L-functions, it is believed that the Generalized Riemann Hypothesis (GRH) 
holds: All non-trivial zeros real part equal to 1/2. 

We end our introduction to our main number theoretic objects of interest by 
noting that (1.12) is capable of massive generalization, not just to other L-functions 
but we can multiply (1.9) by a nice test function ¢(s) instead of the specific function 
x*/s. The result of this choice is to have a formula that relates sums of @ at zeros 
of our L-function to sums of the Fourier transform of ¢ at the primes. For example 
(see Sect. 4 of [83]) one can show 


CO 
y A = (logp"\ logp 
—logR) = —2 ‘ ; 1.17 
L955 ne ) logR dae 6 (ee) ee orn 


Pp v=l 
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where R is a free scaling parameter chosen for the problem of interest, A = 
2(0) log Q + pe Aj with 


bad 1 = 2mwix 
Aj = / w (es + 7 + =) o(x)dx (1.18) 


and the Fourier transform is defined by 


Oy) = / * b(xje 2" dx. (1.19) 


1.2.3 From the Hilbert-Pélya Connection to Random Matrix Theory 


As stated earlier, the Generalized Riemann Hypothesis asserts that the non-trivial 
zeros of the an L-function are of the form p = 1/2 + iy, with y, real. Thus 
it makes sense to talk about the distribution between adjacent zeros. Around 
1913, Pélya conjectured that the y, are the eigenvalues of a naturally occurring, 
unbounded, self-adjoint operator, and are therefore real.* Later, Hilbert contributed 
to the conjecture, and reportedly introduced the phrase ‘spectrum’ to describe the 
eigenvalues of an equivalent Hermitian operator, apparently by analogy with the 
optical spectra observed in atoms. This remarkable analogy pre-dated Heisenberg’s 
Matrix Mechanics and the Hamiltonian formulation of Quantum Mechanics by more 
than a decade. 

Not surprisingly, the Hilbert-Pélya conjecture was considered so intractable that 
it was not pursued for decades, and random matrix theory remained in a dormant 
state. To quote Diaconis [31]: 


Historically, random matrix theory was started by statisticians [150] studying the corre- 
lations between different features of population (height, weight, income...). This led to 
correlation matrices with (i,j) entry the correlation between the ith and jth features. If the 
data were based on a random sample from a larger population, these correlation matrices 
are random; the study of how the eigenvalues of such samples fluctuate was one of the first 
great accomplishments of random matrix theory. 


Diaconis [32] has given an extensive review of random matrix theory from 
the perspective of a statistician. A strong argument can be made, however, that 
random matrix theory, as we know it today in the physical sciences, began 
in a formal mathematical sense with the Wigner surmise [147] concerning the 
spacing distribution of adjacent resonances (of the same spin and parity) in the 
interactions between low-energy neutrons and nuclei, which we describe in great 
detail in Sect. 2. 


3If v is an eigenvector with eigenvalue A of a Hermitian matrix A (so A = A* with A* the complex 
conjugate transpose of A, then v*(Av) = v*(A*v) = (Av)*v); the first expression is A||v||? 
while the last is A||v||?, with ||v||? = v*v = ¥- |v;|? non-zero. Thus A = 1, and the eigenvalues 
are real. This is one of the most important properties of Hermitian matrices, as it allows us to order 
the eigenvalues. 
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2 The ‘Birth’ of Random Matrix Theory in Nuclear Physics 


Below we discuss some of the history of investigations of the nucleus, concentrating 
on the parts that led to the introduction of random matrix theory to the subject. As 
mentioned earlier, this section is expanded with permission from [50]. Our goal is 
to provide the reader with both sides of the coin, highlighting the interplay between 
theory and experiment, and building the basis for applications to understanding 
zeros of L-functions; we have chosen to spend a good amount of space on 
these experiments and conjectures as these are less-well known to the general 
mathematician than the later parts of our story. 

While other methods have since been developed, random matrix theory was the 
first to make truly accurate, testable predictions. The general idea is that the behavior 
of zeros of L-functions are well-modeled by the behavior of eigenvalues of certain 
matrices. This idea had previously been successfully used to model the distribution 
of energy levels of heavy nuclei (some of the fundamental papers and books on the 
subject, ranging from experiments to theory, include [18, 29, 37, 38, 48, 51, 52, 54, 
57, 63, 71, 72, 78, 98-100, 106, 119, 136, 143-148]). We describe the development 
of random matrix theory in nuclear physics below, and then delve into more of the 
details of the connection between the two subjects. 


2.1 Neutron Physics 


The period from the mid-1930s to the late 1970s was the golden age of neutron 
physics; widespread interest in understanding the physics of the nucleus, coupled 
with the need for accurate data in the design of nuclear reactors, made the 
field of neutron physics of global importance in fundamental physics, technology, 
economics, and politics. In Sect. 1.2.1 we introduced some of the early models for 
nuclei, and discussed some of the original experiments. In this section we describe 
later work where better resolution was possible. Later we will show how a similar 
perspective and chain of progress holds in studies of zeros of the Riemann zeta 
function! Thus the material here, in addition to being of interest in its own right, 
will also provide a valuable vantage for study of arithmetic objects. 

In the mid-1950s, a discovery was made that turned out to have far-reaching 
consequences beyond anything that those working in the field could have imagined. 
For the first time, it was possible to study the microstructure of the continuum in a 
strongly-coupled, many-body system, at very high excitation energies. This unique 
situation came about as the result of the following facts. 


¢ Neutrons, with kinetic energies of a few electron-volts, excite states in compound 
nuclei at energies ranging from about five million electron-volts to almost 
ten million electron-volts—typical neutron binding energies. Schematically, 
see Fig. 1. 
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Fig. 1 An energy-level 
diagram showing the location 
of highly-excited resonances 
in the compound nucleus 
formed by the interaction of a 
neutron, n, with a nucleus of 
mass number A. Nature 
provides us with a narrow 
energy region in which the 
resonances are clearly 
separated, and are observable 


Region of broad, 
overlapping 
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| SCattering 


5 fe 
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scattering 
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Ground state of 
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¢ Low-energy resonant states in heavy nuclei (mass numbers greater than about 
100) have lifetimes in the range 10~'4-107~!° s, and therefore they have widths 
of about 1 eV. The compound nucleus loses all memory of the way in which 
it is formed. It takes a relatively long time for sufficient energy to reside in a 
neutron before being emitted. This is a highly complex, statistical process. In 
heavy nuclei, the average spacing of adjacent resonances is typically in the range 
from a few eV to several hundred eV. 

¢ Just above the neutron binding energy, the angular momentum barrier restricts 
the possible range of values of total spin of a resonance, J (J =1+i+1, where I 
is the spin of the target nucleus, i is the neutron spin, and | is the relative orbital 
angular momentum). This is an important technical point. 

¢ The neutron time-of-flight method provides excellent energy resolution at 
energies up to several keV. (See Firk [47] for a review of time-of-flight 
spectrometers.) 


The speed v, of a neutron can be determined by measuring the time f,, that it takes 
to travel a measured distance £ in free space. Using the standard result of special 
relativity, the kinetic energy of the neutron can be deduced using the equation 


E, = Eo((— v2/c*)-¥/? - 1] 
Eld=C ee)? = i, (2.1) 


II 
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where Ey ~ 939.553 MeV is the rest energy of the neutron and c = 2.997925 - 108 
m/s is the speed of light. 

If the units of energy are MeV, and those of length and time are meters and 
nanoseconds, then 


E, = 939.553[(1 — 11.12649607/12)~'/? — 1] Mev. (22) 


It is frequently useful to rearrange this equation to give the ratio 1,/ for a given 
energy, E,,: 


tn/€ = 3.3356404/,/1 — (939.553/(E, + 939.553))2. (2.3) 


Typical values for this ratio are 72.355 ns/m for E,, = 1 MeV and 23.044 ns/m for 
E, = 10 MeV. 
At energies below | MeV, the non-relativistic approximation to (2.3) is adequate: 


(t,/0)ne = WEo/2Enc2 = 72.298//E, ws/m. (2.4) 


In the eV-region, it is usual to use units of jts/m: a 1eV neutron travels 1m in 
72.3 s. At non-relativistic energies, the energy resolution AE at an energy E is 
simply: 


AE & 2EAt/te, (2.5) 


where At is the fotal timing uncertainty, and tg is the flight time for a neutron of 
energy E. 

In 1958, the two highest-resolution neutron spectrometers in the world had total 
timing uncertainties At ~ 200ns. For a flight-path length of 50m the resolution 
was AE = 3eV at | keV. 

In 38U + n, the excitation energy is about 5 MeV; the effective resolution for a 
1 keV-neutron was therefore 


AE] Eecttective ~ 6: 10-7 (2.6) 


(at 1 eV, the effective resolution was about 107!'). 
Two basic broadening effects limit the sensitivity of the method. 


1. Doppler broadening of the resonance profile due to the thermal motion of the 
target nuclei; it is characterized by the quantity 6 ~ 0.3,/E/A (eV), where A is 
the mass number of the target. If E = 1 keV and A = 200, 6 = 0.7eV, a value 
that may be ten times greater than the natural width of the resonance. 

2. Resolution broadening of the observed profile due to the finite resolving power 
of the spectrometer. For a review of the experimental methods used to measure 
neutron total cross sections see Firk and Melkonian [49]. Lynn [95] has given a 
detailed account of the theory of neutron resonance reactions. 
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In the early 1950s, the field of low-energy neutron resonance spectroscopy was 
dominated by research groups working at nuclear reactors. They were located at 
National Laboratories in the United States, the United Kingdom, Canada, and the 
former USSR. The energy spectrum of fission neutrons produced in a reactor is 
moderated in a hydrogenous material to generate an enhanced flux of low-energy 
neutrons. To carry out neutron time-of-flight spectroscopy, the continuous flux from 
the reactor is “chopped” using a massive steel rotor with fine slits through it. At the 
maximum attainable speed of rotation (about 20,000rpm), and with slits a few 
thousandths-of-an-inch in width, it is possible to produce pulses each with a duration 
approximately 1 \1s. The chopped beams have rather low fluxes, and therefore the 
flight paths are limited in length to less than 50m. The resolution at | keV is then 
AE & 20 eV, clearly not adequate for the study of resonance spacings about 10 eV. 

In 1952, there were only four accelerator-based, low-energy neutron spectrom- 
eters operating in the world. They were at Columbia University in New York 
City, Brookhaven National Laboratory, the Atomic Energy Research Establish- 
ment, Harwell, England, and at Yale University. The performances of these early 
accelerator-based spectrometers were comparable with those achieved at the reactor- 
based facilities. It was clear that the basic limitations of the neutron-chopper 
spectrometers had been reached, and therefore future developments in the field 
would require improvements in accelerator-based systems. 

In 1956, a new high-powered injector for the electron gun of the Harwell electron 
linear accelerator was installed to provide electron pulses with very short durations 
(typically less than 200 ns) [51]. The pulsed neutron flux (generated by the (y, n) 
reaction) was sufficient to permit the use of a 56m flight path; an energy resolution 
of 3 eV at 1 keV was achieved. 

At the same time, Professors Havens and Rainwater (pioneers in the field of 
neutron time-of-flight spectroscopy) and their colleagues at Columbia University 
were building a new 385 MeV proton synchrocyclotron a few miles north of the 
campus (at the Nevis Laboratory). The accelerator was designed to carry out 
experiments in meson physics and low-energy neutron physics (neutrons generated 
by the (p, n) reaction). By 1958, they had produced a pulsed proton beam with 
duration of 25ns, and had built a 37m flight path [30, 124]. The hydrogenous 
neutron moderator generated an effective pulse width of about 200ns for 1 keV- 
neutrons. In 1960, the length of the flight path was increased to 200m, thereby 
setting a new standard in neutron time-of-flight spectroscopy [56]. 


2.2 The Wigner Surmise 


At a conference on Neutron Physics by Time-of-Flight, held in Gatlinburg, Ten- 
nessee on November Ist and 2nd, 1956, Professor Eugene Wigner (Nobel Laureate 
in Physics, 1963) presented his surmise regarding the theoretical form of the spacing 
distribution of adjacent neutron resonances (of the same spin and parity) in heavy 
nuclei. At the time, the prevailing wisdom was that the spacing distribution had a 
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Poisson form (see, however, [68]). The limited experimental data then available 
was not sufficiently precise to fix the form of the distribution (see [78]). The 
following quotation, taken from Wigner’s presentation at the conference, introduces 
the concept of random matrices in Physics, for the first time: 


Perhaps I am now too courageous when I try to guess the distribution of the distances 
between successive levels. I should re-emphasize that levels that have different J-values 
(total spin) are not connected with each other. They are entirely independent. So far, 
experimental data are available only on even-even elements. Theoretically, the situation is 
quite simple if one attacks the problem in a simple-minded fashion. The question is simply 
‘what are the distances of the characteristic values of a symmetric matrix with random 
coefficients?’ 

We know that the chance that two such energy levels coincide is infinitely unlikely. 


a1 412 


We consider a two-dimensional matrix, ( ) , in which case the distance between two 


a21 422 


levels is ,/ (a1) — az2)? + 4a‘. This distance can be zero only if aj1 = az2 and aj. = 0. 
The difference between the two energy levels is the distance of a point from the origin, the 
two coordinates of which are (a1; — a22) and aj2. The probability that this distance is S 
is, for small values of S, always proportional to S itself because the volume element of the 
plane in polar coordinates contains the radius as a factor.... 

The probability of finding the next level at a distance S now becomes proportional to SdS. 
Hence the simplest assumption will give the probability 


a) _F 10 
ri exp ( PPS?) Ss (2.7) 


for a spacing between S and S + dS. 
If we put x = pS = S/(S), where (S) is the mean spacing, then the probability distribution 
takes the standard form 


D(x)dx = . xX exp (—2x"/4) dx, (2.8) 


where the coefficients are obtained by normalizing both the area and the mean to unity. 


The form of the Wigner surmise had been previously discussed by Wigner [143], 
and by Landau and Smorodinsky [88], but not in the spirit of random matrix theory. 

The Wigner form, in which the probability of zero spacing is zero, is strikingly 
different from the Poisson form 


p(x)dx = exp(—x)dx (2.9) 


in which the probability is a maximum for zero spacing. The form of the Wigner 
surmise had been previously discussed by Wigner himself [143], and by Landau 
and Smorodinsky [88], but not in the spirit of random matrix theory. 

It is interesting to note that the Wigner distribution is a special case of a general 
statistical distribution, named after Professor E. H. Waloddi Weibull (1887-1979), 
a Swedish engineer and statistician [142]. For many years, the distribution has been 
in widespread use in statistical analyses in industries such as aerospace, automotive, 
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electric power, nuclear power, communications, and life insurance.* The distribution 
gives the lifetimes of objects and is therefore invaluable in studies of the failure 
rates of objects under stress (including people!). The Weibull probability density 
function is 


k k-l 
Weil: kA) = 5 (5) exp (—(x/A)*) (2.10) 
where x > 0, k > 0 is the shape parameter, and A > 0 is the scale parameter. We 
see that Wei(x; 2, 2/./7) = p(x), the Wigner distribution. Other important Weibull 
distributions are given in the following list. 


¢ Wei(x; 1, 1) = exp(—x) the Poisson distribution; 
© Wei(x; 2,4) = Ray(A), the Rayleigh distribution; 
° Wei(x; 3, A) is approximately a normal distribution.° 


For Wei(x; k, A), the mean is AI (1 + (1/k)), the median is A log(2)!/*, and the 
mode is A(k — 1)!/*/k!/, if k > 1. As k — 00, the Weibull distribution has a sharp 
peak at A. Historically, Frechet introduced this distribution in 1927, and Nuclear 
Physicists often refer to the Weibull distribution as the Brody distribution [18]. 

At the time of the Gatlinburg conference, no more than 20 s-wave neutron 
resonances had been clearly resolved in a single compound nucleus and therefore 
it was not possible to make a definitive test of the Wigner surmise. Immediately 
following the conference, Harvey and Hughes [71], and their collaborators, working 
at the fast-neutron-chopper-groups at the high flux reactor at the Brookhaven 
National Laboratory, and at the Oak Ridge National laboratory, gathered their own 
limited data, and all the data from neutron spectroscopy groups around the world, 
to obtain the first global spacing distribution of s-wave neutron resonances. Their 
combined results, published in 1958, showed a distinct lack of very closely spaced 
resonances, in agreement with the Wigner surmise. 

By late 1959, the experimental situation had improved, greatly. At Columbia 
University, two students of Professors Havens and Rainwater completed their Ph.D. 
theses; one, Rosen [124], studied the first 55 resonances in 78U + n up to 1 keV, 
and the other, Desjardins [30], studied resonances in two silver isotopes (of different 
spin) in the same energy region. These were the first results from the new high- 
resolution neutron facility at the Nevis cyclotron. 

At Harwell, Firk et al. [48] completed their study of the first 100 resonances in 
38) + n at energies up to 1.8 keV; their measurement of the total neutron cross 


“Tn fact, one of the authors has used Weibull distributions to model run production in major league 
baseball, giving a theoretical justification for Bill James’ Pythagorean Won-Loss formula [103]. 
Obviously this Weibull cannot be a normal distribution, as they have very different decay rates 
for large x, and this Weibull is a one-sided distribution! What we mean is that for 0 < x < 2 this 
Weibull is well approximated by a normal distribution which shares its mean and variance, which 
are (respectively) (4/3) ~ 0.893 and '(5/3) — (4/3)? © 0.105. 
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Fig. 2. High resolution studies of the total neutron cross section of 7°°U, in the energy range 
400-1800 eV. The vertical scale (in units of “barns”) is a measure of the effective area of the target 
nucleus 


section for the interaction 7*°U + n in the energy range 400-1800 eV is shown in 
shown in Fig. 2. 

When this experiment began in 1956, no resonances had been resolved at 
energies above 500eV. The distribution of adjacent spacings of the first 100 
resonances in the single compound nucleus, 77*U + n, ruled out an exponential 
distribution and provided the best evidence (then available) in support of Wigner’s 
proposed distribution. 

Over the last half-century, numerous studies have not changed the basic findings. 
At the present time, almost 1000 s-wave neutron resonances in the compound 
nucleus *°°U have been observed in the energy range up to 20keV. The latest 
results, with their greatly improved statistics, are shown in Fig. 3 [29]. 
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Fig. 3. A Wigner distribution fitted to the spacing distribution of 932 s-wave resonances in the 
interaction °U + n at energies up to 20 keV 


2.3 Some Nuclear Models 


It is interesting to note that, during the 1950s and 1960s, the study of the spacing 
distribution of neutron-induced resonances was far from the main stream of research 
in nuclear physics; almost all research was concerned with fundamental questions 
associated with nuclear structure and not with quantum statistical mechanics. The 
newly-discovered Shell Model [90, 97] of nuclei, and developments such as the 
Collective Model [14, 121] were popular, and quite rightly so, when the successes 
of these models in accounting for the observed energies, spins and parities, and 
magnetic moments of nuclear states, particularly in light nuclei (mass numbers < 
20, say) were considered. 

These models were not able to account for the spacing distributions in heavy 
nuclei (mass numbers A > 150); the complex nature of so many strongly interacting 
nucleons prevented any detailed analysis. However, the treatment of such complex 
problems had been considered in the mid-1930s, before the advent of the Shell- 
Model. The Fermi Gas Model and other approaches based upon quantum versions 
of classical statistical mechanics and thermodynamics, were introduced, particularly 
by Bethe [10]. The Fermi Gas Model treats the nucleons as non-interacting spin-5 
particles in a confined volume of nuclear size. This, of course, seems at variance 
with the known strong interaction between pairs of nucleons. However, the argument 
is made that the nuclear gas is completely degenerate and therefore, because of the 
Pauli exclusion principle, the nucleons can be considered free! The model was the 
first to predict the energy-dependence of the density of states in the nuclear system. 
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The number of states that are available to a freely moving particle in a volume V 
(the nuclear volume) that has a linear momentum in the range p to p + dp is 


dn = (4nV/h>)p*dp. (2.11) 
This leads to 
n = (V/3n7h?)p> (2.12) 


where the result has been doubled because of the twofold spin degeneracy of the 
nucleons. The “Fermi energy” Er corresponds to the maximum momentum: 


Er = Deass Dusan: (2.13) 


The level density p(£*) at an excitation energy E* predicted by the model is 
p(E*) = p(0)exp (2VaE*) (2.14) 
where a is given by the equation 
E* = a(kTY (2.15) 


in which k is Boltzmann’s constant and T is the absolute temperature. The above 
expression for the level density is for states of all spins and parities. 

In practical cases, E* is about 6 MeV for low- energy neutron interactions; this 
value leads to the following ratio for the mean level spacing at E* = 6 MeV and at 
E* = 0 (the ground state): 


(D(6 MeV)) /(D(0)) © 4- 1078. (2.16) 


For (D(0)) = 100 keV (a practical value), the mean level spacing at E* = 6MeV 
is ~ 4-107 eV, which is more than three orders-of-magnitude smaller than typical 
values observed in heavy nuclei. 

Many refinements of the model were introduced over the years; the models take 
into account spin, parity, and nucleon pairing effects. A frequently used refined 
form is 


p(E*,J) = p(E*,0)(2J + lexp(—VUV + 1))/207), (2.17) 
where o is called the “spin-cut-off parameter”; the value of o? is typically about 10. 


The predicted spacing distributions for two values of o, and their comparison with 
a Wigner and an exponential distribution is shown in Fig. 4. 
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Fig. 4 The spacing 1.0 
distribution of adjacent levels 
of the same spin and parity 
follows a Wigner distribution. 
For a completely random 
distribution of levels (in both 
spin and parity) the 
distribution function is 
exponential. The distributions 
for random superpositions of 
several sequences (each of 
which is of a Wigner form 
with a characteristic spin and 
parity) are, for level densities 
given by (2.17) ando = 1 
and 3, found to approach the 
exponential distribution 
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2.4 The Optical Model 


In 1936, Ostrofsky et. al. [118] introduced a model of nuclear reactions that 
employed a complex nuclear potential to account for absorption of the incoming 
nucleon. Later, Feshbach et al. [45] introduced an important development of the 
model that helped further our understanding of the average properties of parameters 
used to describe nuclear reactions at low energies. 

The following discussion provides insight into the physical content of their 
model. Consider the plane-wave solutions of the Schrddinger equation: 


2 


ae + (2m/W)[E+Vo+iW]p = 0, $ = exp(tikx), (2.18) 


where the + sign indicates outgoing waves and the — sign indicates incoming 
waves. The wave number, k is complex: 


k = V(Qm/h2)[(E + VO) + iW], (2.19) 
which can be written 
k = ke+ Km. (2.20) 


For W < (E+ Vo) (a reasonable assumption) we have 


ke = 1/A ® J(Qm/h2)(E + Vo) 
Kim = [W/(E + Vo)](k/2). (2.21) 


From Quantum Systems to L-Functions: Pair Correlation Statistics and Beyond 145 


Taking typical practical values E = 10 MeV, Vo = 40 MeV and W = 10MeV, the 
wave numbers are kg © 1.5fm — 1 and Kjy © kp/10 & 0.15fm7!. 
We see that the outgoing solution of the wave equation is 


ob = exp (ikgx) exp (—Kimx) , (2.22) 


which represents an exponentially attenuated wave. The wave number Kym is 
effectively an attenuation coefficient. The “decay length” associated with the 
probability function ||? is the “mean free path”: 


A = 1/2Kiq = (E+ Vo)/Wkr. (2.23) 


Using the above values for the energies, we obtain A ~ 3.2fm. This value is 
of nuclear dimension, and supports the underlying hypothesis of the Compound 
Nucleus Model. 

If the mean spacing of energy levels of a particle of mass m inside the compound 
nucleus is (D), and its wave number is K, then the particle covers a distance 


d = (h/(D))((hK/2mm) = (h’K)/(2xm(D)) (2.24) 


inside the nucleus at an average speed (v) ~ hK/27m before it is emitted (or before 
another indistinguishable particle is emitted). At an excitation energy of 10 MeV, 
a mean level spacing (D) ~ 40eV, and a mean lifetime h/(D) ~ 107!°s are 
predicted. These are reasonable values, considering the crudeness of the model. 

The level density and level widths increase as the neutron bombarding energy 
increases; an energy region is therefore reached in which the levels completely 
overlap. Cross section measurements then provide information on the average 
properties of the levels and, in particular, on the neutron strength function [92] 
defined as 


S = (yan)?/(D) (2.25) 


in which = (y,,)? is the average reduced neutron width and (D) is the average 
spacing. For s-wave neutrons, Vi. = 2kaIy,, where k is the neutron wave number, 
ais the nuclear radius, and I, is the neutron width of the level 1. 

The average absorption cross section (aps) may be obtained by averaging over 
the collision function U [92]. The following expressions are then obtained: 


II 


1—|(U)? = 2x (Pin)/(D)) 
(Gass) = (r/K)g (1-(U)) (2.26) 


where g is a Statistical “spin weighting factor”. 
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The term 1 — |(U)|? is directly related to the cross section for the formation of a 
compound nucleus [45] which is, in turn, proportional to the strength function. The 
importance of studying the spacing distribution of resonances, of a given spin and 
parity, originated in recognizing that the value of (D), the average spacing, appears 
as the denominator in the fundamental strength function. 


2.5 Further Developments 


The first numerical investigation of the distribution of successive eigenvalues 
associated with random matrices was carried out by Porter and Rozenzweig in 
the late 1950s [120]. They diagonalized a large number of matrices where the 
elements are generated randomly but constrained by a probability distribution. The 
analytical theory developed in parallel with their work: Mehta [98], Mehta and 
Gaudin [100], and Gaudin [57]. At the time it was clear that the spacing distribution 
was not influenced significantly by the chosen form of the probability distribution. 
Remarkably, the x n distributions had forms given almost exactly by the original 
Wigner 2 x 2 distribution. 

The linear dependence of p(x) on the normalized spacing x (for small x) is a direct 
consequence of the symmetries imposed on the Hamiltonian matrix, H(hj). Dyson 
[37] discussed the general mathematical properties associated with random matrices 
and made fundamental contributions to the theory by showing that different results 
are obtained when different symmetries are assumed for H. He introduced three 
basic distributions; in Physics, only two are important, they are: 


¢ the Gaussian Othogonal Ensemble (GOE) for systems in which rotational 
symmetry and time-reversal invariance holds (the Wigner distribution): p(x) = 
(1/2) x exp (—(2/4)x”); 

¢ the Gaussian Unitary Ensemble (GUE) for systems in which time-reversal 
invariance does not hold (French et. al. [54]): p(x) = (32/27)x? exp(—(/4)x’). 


The mathematical details associated with these distributions are given in [98]. 

The impact of these developments was not immediate in nuclear physics. 
At the time, the main research endeavors were concerned with the structure of 
nuclei—experiments and theories connected with Shell-, Collective-, and Unified 
models, and with the nucleon-nucleon interaction. The study of quantum statistical 
mechanics was far removed from the mainstream. Almost two decades went by 
before random matrix theory was introduced in other fields of physics (see, for 
example, Bohigas et al. [16] and Alhassid [1]). 
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2.6 Lessons from Nuclear Physics 


We have discussed at great length the connections between nuclear physics and 
number theory, with random matrix theory describing the behavior in these two very 
different fields. Before we analyze in great detail the success it has had in modeling 
the zeros of L-functions, it’s worth taking a few moments to create a dictionary 
comparing these two subjects. 

In nuclear physics the main object of interest is the nucleus. It is a many-bodied 
system governed by complicated forces. We are interested in studying the internal 
energy levels. To do so, we shoot neutrons (which have no net charge) at the nucleus, 
and observe what happens. Ideally we would be able to send neutrons of any energy 
level; unfortunately in practice we can only handle neutrons whose energies are in 
a certain band. The more energies at our disposal, the more refined an analysis is 
possible. Finally, there is a remarkable universality from heavy nucleus to heavy 
nucleus, where the distribution of spacings between adjacent energy levels depends 
weakly on the quantum numbers. 

Interestingly, there are analogues of all these quantities on the number theory 
side. The nucleus is replaced by an L-function, which is built up as an Euler product 
of many factors of arithmetic interest. We are interested in the zeros of this function. 
We can glean information about them by using the explicit formula, (1.17). We first 
choose an even Schwartz test function ¢ whose Fourier transform ¢ has compact 
support. The explicit formula relates sums of ¢ at the zeros of the L-function 
to weighted sums of @ at the primes. Thus the more functions @ where we can 
successfully execute the sums over the primes, the more information we can deduce 
about the zeros. Unfortunately, in practice we can only evaluate the prime sums for @ 
with small support (if we could do arbitrary d, we could take a sequence converging 
to the constant function 1, whose inverse Fourier transform would be a delta spike at 
the origin and thus tell us what is happening there). Similar to the weak dependence 
on the quantum numbers, the answers for many number theory statistics depend 
weakly on the Satake parameters (whose moments are the Fourier coefficients in 
the series expansion of the L-function). In particular, the spacing between adjacent 
zeros is independent of the distribution of these parameters, though other statistics 
(such as the distribution of the first zero or first few zeros above the central point) 
fall into several classes depending on their distribution. 

We collect these correspondences in the table below. While the structures studied 
in the two fields are very different, we can unify the presentations. In both settings 
we study the spacings between objects. While there are exact rules that govern 
their behavior, these are complicated. We gain information through interactions of 
test objects with our system; as we can only analyze these interactions in certain 
windows, we gain only partial information on the items of interest. 

We end by extracting some lessons from nuclear physics for number theory. 
The first is the importance of using the proper test function, or related to that the 
proper statistic. In the gold-foil experiments (1908 to 1913) positively charged alpha 
particles, which are helium nuclei, were used. Because they have a net positive 
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Item Nuclear physics Number theory 

Object Nucleus L-function 

Events Energy levels Zeros 

Probe Neutron (no net charge) | Test function ¢ (Schwartz) 
Restriction Neutron’s energy Supp) 

Individuality | Quantum numbers Satake parameters 


charge, they are repelled by the nucleus they are probing. With the discovery of the 
neutron in 1932, physicists had a significantly better tool for studying the nucleus. 
As the machinery improved, more and more neutron energy levels were available, 
which led to sharper resolutions of the internal structure. We see variants of these on 
the number theory side, from restrictions on the test function to the consequences of 
increasing support. For example, when Wigner made his bold conjectures the data 
was not sufficiently detailed to rule out Poissonian behavior; that was not done until 
later when better experiments were carried out. Similar situations arise in number 
theory, where some statistics are consistent with multiple models and only by 
increasing the support are we able to determine the true underlying behavior. Finally, 
while there is a remarkable universality in behavior of the zeros, as for statistics 
such as adjacent spacings or n-level correlations the exact form of the L-function 
coefficients do not matter, these distributions do affect the rate of convergence to 
the random matrix theory predictions, as well as govern other statistics. 


3 From Class Numbers to Pair Correlation 
and Random Matrix Theory 


The discovery that the pair correlation of the zeros of the Riemann zeta function 
(and other statistics of its zeros, and the zeros of other L-functions) are related 
to eigenvalues of random matrix ensembles has its beginnings with one of the 
most challenging problems in analytic number theory: the class number problem. 
Hugh Montgomery’s investigation into the vertical distribution of the nontrivial 
zeros of €(s) arose during his work with Weinberger [108] on the class number 
problem. We give a short introduction to this problem to motivate Montgomery’s 
subsequent work on the differences between zeros of ¢(s). We assume the reader 
is familiar with the basics of algebraic number theory and L-functions; an excellent 
introduction is Davenport’s classic Multiplicative Number Theory [27]. For those 
wishing a more detailed and technical discussion of the class number problem 
and its history, see [61, 62]. We then continue with a discussion of Montgomery’s 
work on pair correlation, followed by the work of Odlyzko and others on spacings 
between adjacent zeros. After introducing the number theory motivation and results, 
we reveal the connection to random matrix theory, and conclude with a discussion 
of the higher level correlations, other related statistics, and open problems. 
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As there are too many areas of current research to describe them all in detail in a 
short article, we have chosen to concentrate on two major areas: the main terms for 
the n-level correlations, and the lower order terms; thus we do not describe many 
other important areas of research, such as the determination of moments or value 
distribution. The main terms are believed to be described by random matrix theory; 
however, the lower order terms depend on subtle arithmetic of the L-functions, and 
there we can see different behavior. The situation is very similar to that of the Central 
Limit Theorem, and we will describe these connections and viewpoints in greater 
detail below. 


3.1 The Class Number Problem 


Let K = Q(/—9) be the imaginary quadratic field associated to the negative 
fundamental discriminant —g. Here we have that —q is congruent to 1 (mod4) and 
square-free or —g = 4m, where m is congruent to 2 or 3 (mod4) and square-free. 
The class number of K, denoted h(—g), is the size of the group of ideal classes of K. 
When h(—q) = 1, the ring of integers of K, denoted Gx, has unique factorization. 
Such an occurrence (the class number one problem, discussed below) is rare, and 
the class number h(—q) may be thought of as a measure on the failure of unique 
factorization in Ox. 

One of the most difficult problems in analytic number theory is to estimate 
the size of h(—q) effectively. Gauss [58] showed that h(—q) is finite and further 
conjectured that h tends to infinity as —q runs over the negative fundamental 
discriminants. This conjecture was proved by Heilbronn [75] in 1934. Thus, while 
it is settled that there are only finitely many imaginary quadratic fields with a 
given class number /(—q), an obvious question remains: can we list all imaginary 
quadratic fields K with a given class number h(—q)? This is the class number 
problem. 

One may easily deduce an upper bound on h(—d) via Dirichlet’s class number 
formula. For )i(s) > 1, let L(s, xy_,) denote the Dirichlet L-function 


L(s, Yq) = _ (3.1) 
n=1 


where y—,(n) is the Kronecker symbol associated to the fundamental discrimi- 
nant —q. In order to prove the equidistribution of primes in arithmetic progression, 
Dirichlet derived the class number formula, 


h(—q) = "Fn, X-4)> (3.2) 
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where w denotes the number of roots of unity of K = Q(,/—q): 


2 if q>4 
w= 44 if g=4 (3.3) 
6 if g=3. 


Dirichlet needed to show L(1, x—q) # 0, which is immediate from the class number 
formula as h(—q) > 1. This connection between class numbers and zeros of 
L-functions is almost 200 years old, and illustrates how knowledge of zeros of 
L-functions yields information on a variety of important problems. 

Instead of using the class number formula to prove non-vanishing of L-functions, 
we can use results on the size of L-functions to obtain bounds on the class number. 
Combining (3.2) with that fact that L(1, y-,) « logg, it follows that h(—q) < 
./q\og q. On the other-hand, Siegel [132] proved that for every ¢ > 0 we have 
LC, x-q) > c(e)q-°, where c(e) is a constant depending on « that is not numerically 
computable for small ¢. Upon inserting this lower bound in (3.2), it follows that 
h(—q) > c(e)q'/?-®; however this does not help us solve the class number problem 
because the implied constant is ineffective. Computing an effective lower bound 
on h(—q) is very difficult task. 

The class number one problem was eventually solved independently by Heegner 
[74], Stark [133] and Baker [8]. For h(—d) = 2, the class number problem was 
solved independently by Stark [134], Baker [9] and Montgomery and Weinberger 
[108]. In 1976, Goldfeld [59, 60] showed that if there exists an elliptic curve E 
whose Hasse-Weil L-function has a zero at the central point s = 1| of order at least 
three, then for any ¢ > 0, we have h(—q) > cz.z log(| — q|)'~®, where the constant 
Ce.g iS effectively computable. In other words, Goldfeld proved that if there exists 
an elliptic curve whose Hasse-Weil L-function has a triple zero at s = 1, then the 
class number problem is reduced to a finite amount of computations. In 1983, Gross 
and Zagier [66] showed the existence of such an elliptic curve. Combining this deep 
work of Gross-Zagier with a simplified version of Goldfield’s argument to reduce 
the amount of necessary computations, Oesterlé [115] produced a complete list of 
imaginary quadratic fields with h(—q) = 3. To date, the class number problem is 
resolved for all 1 < h(—q) < 100. (In addition to the previous references, see Arnon 
[4], Arnon et al. [5], Wanger [140] and Watkins [141].) 


®In other words, while the above is enough to prove that the class number tends to infinity, we 
cannot use that argument to produce an explicit constant Q, for each n so that we could assert 
that the class number is at least n if g => Q,. One of the best illustrations of the importance of 
effective constants is the following joke: There is a constant 7p such that if all the non-trivial zeros 
of ¢(s) in the critical strip up to height Tp are on the critical line, then they all are and the Riemann 
Hypothesis is true; in other words, it suffices to check up to a finite height! To see this, if the 
Riemann Hypothesis is true we may take 7p to be 0, while if it is false we take Tp to be | more than 
the height of the first exemption. We have therefore shown a constant exists, but such information 
is completely useless! 
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Combining their work with results of Stark [134] and Lehmer et al. [93], 
Montgomery and Weinberger gave a complete proof for the class number two 
problem. Their proof is based on the curious Deuring-Heilbronn phenomenon, 
which implies that if h(—d) < d'/4~* then the low-lying nontrivial zeros of many 
quadratic Dirichlet L-functions are on the critical line, at least up to some height 
depending on d, 5, and the L-functions. For an overview of the Deuring-Heilbronn 
phenomenon, see the survey article by Stopple [135]. Montgomery and Weinberger 
also establish that if the class number is a bit smaller, then one can show that these 
nontrivial zeros on the critical line are very evenly spaced. Moreover, more precise 
information about the vertical distribution of these zeros would imply an effective 
lower bound on h(—d). Montgomery and Weinberger write: 


Let po = 1/2 + iy and p’ = 1/2 + iy’ be consecutive zeros on the critical line of an 
L-function L(s, 7), where y is a primitive character (modk). Put 


1 
A(K) = min —ly —y’|logK, (3.4) 
20 


where the minimum is over all k < K, all y (modk), and all p = 1/2 + iy of L(s, y) with 
|y| < 1. In this range the average of |y — y’| is 277/ log k, so trivially limsupA(K) < 1. 
Presumably A(K) tends to 0 as K increases; if this could be shown effectively then the 
effective lower bound h > d'/4—* would follow. In fact the weak inequality A(K) < 1/4—6 
for K > Ko implies that h > d“/28—€ for d > C(Ko, €); the function C(Ko, €) can be made 


explicit. Even A(K) < ; — 6 has striking consequences. 


3.2 Montgomery’s Pair Correlation of the Zeros of &(s) 


We have seen that the class number problem is related to another very difficult 
question in analytic number theory: What is the vertical distribution of the zeros 
of the Riemann zeta function (and general L-functions) on the critical line? 

Given an increasing sequence {a,}°2, and a box B C R"|, the n-level 
correlation is defined by 


lim # {(qj, Aj y ee 5 Aj, — oti, ) € B Ji # in} . (3.5) 
Noo N 


The pair correlation is the case n = 2, and through combinatorics knowing all the 
correlations yields the spacing between adjacent events (see for example [99]). In 
1973, Montgomery [107] was able to partially determine the behavior for the pair 
correlation of zeros of the Riemann zeta function, €(s), which led to new results on 
the number of simple zeros of €(s) and the existence of gaps between zeros of f(s) 
that are closer together than the average. One of the most striking contributions in 
Montgomery’s paper, however, is his now famous pair correlation conjecture. We 
first state his conjecture and then discuss related work on spacings between adjacent 
zeros in the next subsection; after these have been described in detail we then revisit 
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these problems and describe the connections with random matrix theory in Sect. 3.5. 
See [25] for more on connections between spacings of zeros of ¢(s) and the class 
number. 


Conjecture 1 (Montgomery’s Pair Correlation Conjecture). Assume the Riemann 
hypothesis, and let y, y’ denote the imaginary parts of nontrivial zeros of €(s). For 
fixedO<a<b<o, 


fim YO SY ST, 2nallogT) | < y —y' < 2xb(logT)'} 
im 


T-00 + log T 


2 sin zu \7 
= i 1- du. (3.6) 
- TU 


Thus Montgomery’s pair correlation conjecture is the statement that the pair 
correlation of the zeros of €(s) is 
sin tu \* 
1- F (3.7) 
wu 


Notice that the factor 1 — (sinzu/mu)* suggests a ‘repulsion’ between the 
zeros of €(s). The notion that the zeros cannot be too close to one another was 
also revealed in the aforementioned work of Montgomery and Weinberger as a 
consequence of the Deuring-Heilbronn phenomenon. 

To arrive at his conjecture, Montgomery introduced the function 


F@,T) = DP x? wy-7), (3.8) 


O<y,y/<T 


where w(u) is a weight function given by w(u) = 4/(4 + wu’). Let F(a) denote 
F(x, T) with x set as x = T®; then 


-1 
F(a) = F(a,T) = (= ver) Yo TOM wy -y’), (3.9) 


O<y,y/ST 


where a and T > 2 are real. F(a) is a real, even function. Let r(u) € L', and define 
its Fourier transform by 


r(@) = ie r(ujer™""du. (3.10) 


foe) 


The function r is a test function that replaces the ‘box’ in the statement of the pair 
correlation Conjecture 1. One notable item about Montgomery’s pair correlation 
conjecture is that there is no restriction on the length of the interval [a, b]; the 
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difference b — a is permitted to be arbitrarily small. In the language of smooth 
test functions, this translates to permitting arbitrarily large support on the Fourier 
transform 7. 

If 7(a) € L', then upon multiplying (3.9) by *(a) and integrating, we deduce 


wy) los T T cane 
(oper v-) ~ (= 127) [. (a)F(w)da (3.11) 


O<y,y/<T 


as T tends to infinity. If the Riemann hypothesis is true, the asymptotic (3.11) 
connects the pair correlation of ¢(s) to the function F(a) given in (3.9). Montgomery 
proceeded to prove an important special case of Conjecture | for a class of test 
functions with Fourier transform supported in (—1, 1). 


Theorem 1 (Montgomery’s Theorem). Assume the Riemann hypothesis. For real 
a, T > 2, let F(a) be defined by (3.9). Then F(a) is real, and F(a) = F(—a). If 
T > To(€) then F(a) => —e for all a. For fixed a satisfying 0 < a < 1 we have 


F(a) = a+o(1) +7 ~~ log T(1 + 0(1)) (3.12) 


uniformly for 0 < a < 1 as T tends to infinity. 


Thus, for any function r(v) € L! with Fourier transform 7(@) supported in 
(—1, 1), one can use (3.12) to evaluate the sums appearing in (3.11). Fora > 1, 
Montgomery further conjectured, with heuristic arithmetic justification, that 


F(a) = 1+ o0(1)  uniformlyinboundedintervalsasT — oo. (3.13) 


This conjecture, combined with (3.12) gives a complete picture of the function F(a), 
which led Montgomery to make his pair correlation conjecture. 


3.3. Proof of Montgomery’s Pair Correlation Conjecture 
for Restricted a,b 


We now provide greater detail about Montgomery’s original proof [107, Sect. 3, 
pp. 187-191] of his theorem (Theorem 1). The point of entry is an explicit formula 
due to him. 

The role of explicit formule cannot be overstated when working with ¢(s) 
or L-functions, as these formule unlock the multiplicative structure implicit in 
the Euler product, usually via the argument principal applied to the logarithmic 
derivative. Assuming the Riemann hypothesis, and writing critical zeros of €(s) as 
1/2 + iy and y real, with | <o < 2 andx > 1, Montgomery proved that 
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00-1) . xi 
y 


o—1)+(t-yyP 


= x1? (x A(n) a= 7 3 Abo (3)"") 


n<x 


4 ergs 4510, (1). Onaeor ), (3.14) 


where t = |t| + 2 and the implied constants depend only ono. 


Proof (Proof of Montgomery’s Theorem (Theorem 1); [107, Sect. 3, pp. 187-191]). 
Placing o = 3/2 in (3.14), and letting L(x, t) and R(x, t) denote the left and right 
sides, respectively, we now wish to evaluate the second moments of both sides; i.e. 
i. |L(x, t)|* dt, Se |R(x, t)|? dt. The reason to do this is that, as we will see, F(a) 
falls out of the second moment of the left side, and we end up with something 
tractable for the second moment of the right side. Thus the equation of the two 
moments gives us an identity for F(a). 

By showing the contribution of those ordinates y above height T is O(log? 7), 
Montgomery obtained 


aa a dt 
|L(x,t)|? dt = 4 xv) / + O(log? T). 
| » 0 d+(t—y))C + (t- y’)) 
O<y/<T 


(3.15) 


Note that the range of integration may be extended to all of R at a penalty no greater 
in magnitude than O(log’ T); we then have 


T oo 
es dt 
|L(x, t)|? dt = 4 x0) / +O(log? 7); 
I 2 co 1+ = yA + C=) 
0<y/<T 
(3.16) 
it then follows from the residue calculus that the definite integral evaluates to w(y — 
y')x/2 and 


T 
i |L(x, 1)|? dt = 2x ~~ ¥O-Y) wy — y’) + O(log? 7). (3.17) 
0 O<y<T 
O<y’/<T 


Putting x = T% yields 


T 
/ |L(x, t)|? dt = F(a)T logT + O(log? 7). (3.18) 
0 
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The non-negativity of the left side of (3.18) gives the statement in Theorem | of the 


positivity of F(a). (The evenness of F(a) follows from the fact that y and y’ may 
be interchanged in the definition (3.9).) It then falls to evaluate i |R(x, |? dt. First, 


T 
T 
/ Foe log t| dt = = (log T + O(log T)) (3.19) 
i x 


for all x => 1,7 => 2. Montgomery then applied a quantitative version of Parseval’s 
identity for Dirichlet series to find 


r 


Applying (3.20) to the explicit formula (3.14), we find 


2 
dt = S°|a,|? (T+ O(n). (3.20) 


> } a,nit 


n 


2 


: [ » Me oe * = 5 q@y" 


= . yo Ain) (*)- (T+ O(n) + : So A(n)? (*) (T + O(n)) 


nsx n>X 


= T(logx + O(1)) + O(xlogx), (3.21) 


where the last line follows from the prime number theorem with error term. It then 
follows from simple estimation of the error terms and a more delicate application of 
Cauchy-Schwarz that 


T 
i} IR(T*, t)|’ dt = ((1 + 0(1))T ~ logT + a + o(1))T log T, (3.22) 
0 


uniformly for 0 < a < 1 — e. Combining (3.18) and (3.22) yields Montgomery’s 
theorem. Oo 


We end this section by describing the heuristic evidence that led Montgomery 
to conjecture (3.13) on the behavior of F(a) for a > 1. The argument above for 
proving Montgomery’s conjecture for 0 < a < 1 fails for a > 1, since error terms 
such as in (3.21) and those arising from Cauchy-Schwarz and the last line of (3.14) 
are no longer dominated by the main term. 

Examining the sum over primes from the explicit formula (3.14) with o = 3/2, 


So A(n) cy" + An) Cue (3.23) 
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the expected value is seen by the prime number theorem to be 


Qxlvit 
(3 +i) (3-H) 


From the proof of Montgomery’s theorem we have, with F(x, T) as in (3.8), that 


F(x,T) = sl S> An) Cys Dae ‘Ca 


(3.24) 


nsx n>Xx 
. 2 

Qx1- it 

2 2 
it follows that we would like to know the size of 
Pils Qxl/2-it 2 
| =O Aa?" + x SA)" — ——_____| dt. (3.26) 
0 - nsx n>XxX G ca it) (5 ~ it) 


Montgomery proceeded to multiply out and integrate term-by-term, finding that the 
non-diagonal is non-neglectable. He collected terms in the form of sums of the sort 


SAMA +h); (3.27) 


ny 


invoking the Hardy-Littlewood k-tuple conjecture for 2-tuples with a strong error 
term, (3.27) should be = y. This would give 


T 
FT) ~ 5— log? (3.28) 


inx < T < x?~*, and there is little reason to expect the behavior to change for 
bounded a > 2. On this basis, Montgomery made his conjecture (3.13). 


3.4 Spacings Between Adjacent Zeros 


Motivated by Montgomery’s pair correlation conjecture on the zeros of the Riemann 
zeta function, starting in the late 1970s Andrew Odlyzko began a large-scale 
computation of zeros of ¢(s) high in the critical strip. The average spacing between 
zeros of ¢(s) at height T in the critical strip is on the order of 1/ log T; thus as we go 
higher and higher we have more and more zeros in regions of fixed size, and there 
is every reason to hope that, after an appropriate normalization, a limiting behavior 
exists. 
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The story of computing zeta zeros goes back to Riemann himself. As mentioned 
in Sect. 1.2.2, in his one paper on the zeta function [123], Riemann states the 
Riemann hypothesis (RH) in passing. He used a formula now known as the 
Riemann-Siegel formula to compute a few zeros of €(s) up to a height of probably no 
greater than 100 in the critical strip; though he did not mention these computations 
in the paper, the role of these computations was important in the development 
of mathematics and mirror the role played by the calculation of energy levels in 
nuclear physics in illuminating the internal structure of the nucleus. The formula 
was actually lost for almost 70 years, and did not enter the mathematics literature 
until Siegel was reading Riemann’s works [131]. Siegel’s role in understanding, 
collecting, and interpreting Riemann’s notes should not be underestimated, since 
the expertise and insight needed to infer the ideas behind the notes was great. 

The development of the Riemann-Siegel formula proceeds along the purely 
classical lines of complex analysis. Riemann had a formula for ¢(s) valid for all 
s € C; namely, 


Trd-s) (—x)* dx 
201 ge-1l x 


SC 


, (3.29) 


where @ is the contour that starts at +00, traverses the real axis towards the origin, 
circles the origin once with the positive orientation about 0, and then retraces its 
path along the real axis to +00. 

By splitting off some finite sums from the contour integral above, Riemann 
arrived at the formula 


N M 
s) = i 1/2-—s lr (5) 1 
69) = ag tt Ty La 


n=1 n=1 


T—s) (—x)e"** dx 
2ni ey eI x 


, (3.30) 


where here s € C, N,M € N are arbitrary, and Gy is the contour that traces from 
+oo0 to (2M + 1)z, circles the line |s| = (2M + 1)z once with positive orien- 
tation, and then returns to +00, thereby enclosing the poles +27iM, +27i(M — 
1),..., +2zi, and the singularity at 0. This formula for ¢(s) can be regarded as 
an approximate functional equation, where the remainder is expressed explicitly in 
terms of the contour integral over Gy. The main task in developing the Riemann- 
Siegel formula then falls to estimating the contour integral over Gy using the 
saddle-point method. 

Prior to Siegel’s work, in 1903 Gram showed that the first 10 zeros of f(s) lie on 
the critical line, and showed that these 10 were the only zeros up to height 50. The 
development of the above, along with a cogent narrative of Riemann, Siegel, and 
Gram’s contributions, may be found in Edwards [39]. 

In almost every decade in the last century, mathematicians have set new records 
for computations of critical zeros of ¢(s). Alan Turing brought the computer to 
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bear on the problem of computing zeta zeros for the first time in 1950, when, 
as recounted by Hejhal and Odlyzko [77], Turing used the Manchester Mark 1 
Electronic Computer, which had 25,600 bits of memory and punched its output on 
teleprint tape in base 32, to verify every zero up to height 1540 in the critical strip 
(he found there are 1104 such zeros). Turing also introduced a simplified algorithm 
to compute zeta zeros now known as Turing’s method. Turing published on his 
computer computations and his new algorithm for the first time in 1953 [139]. 

Following Turing, the computation of zeros of f(s) took off thanks to the 
increasing power of the computer. At this time, the first 10!* nontrivial zeros of 
f(s), tens of billions of nontrivial zeros around the 107° and 1074, and hundreds 
of nontrivial zeros near zero number 10°? are known to lie on the critical line. 
Additionally, new algorithms by Schénhage and Odlyzko, and by Schénhage, 
Heath-Brown, and Hiary have sped up the verification of zeta zeros. 

However, the aforementioned projects for numerically checking that zeros of €(s) 
lay on the critical line were not concerned with accurately recording the height 
along the critical line of the zeros computed; only with ensuring the zeros had real 
part exactly 1/2. This changed in the late 1970s with a series of computations by 
Andrew Odlyzko, who was motivated not only by the Riemann Hypothesis but also 
by Montgomery’s pair correlation conjecture. 

Rather than verify consecutive zeros starting from the critical point, Odlyzko was 
interested in starting his search high up in the critical strip, in the hope that near zero 
number 10!7, the behavior of £(s) would be closer to its asymptotic behavior. For, 
as Montgomery’s pair correlation conjecture is a statement about the limit as one’s 
height in the critical strip passes to infinity, one would wish to know the ordinates 
of many consecutive zeta zeros in the regime where ¢(s) is behaving asymptotically 
if one wished to test the plausibility of the conjecture. 

As he explains [110], his first computations [109] were in a window around zero 
number 10!*, and were done on a Cray supercomputer using the Riemann-Siegel 
formula. These computations motivated Odlyzko and Arnold Schénhage to develop 
a faster algorithm for computing zeros [111, 114], which was implemented in the 
late 1980s and was subsequently used to compute several hundred million zeros near 
zero number 102° and some near number 2 - 10”°, as seen in [112, 113]. 


3.5 Number Theory and Random Matrix Theory Successes 


After its introduction as a conjecture in the late 1950s to describe the energy levels 
of heavy nuclei, random matrix theory experienced successes on both the numerical 
and the experimental fronts. The theory was beautifully developed to handle a large 
number of statistics, and many of these predictions were supported as more and 
more data on heavy nuclei became available. While there was significant theoretical 
progress (see, among others, [37, 38, 57, 98, 100, 143—148]), there were some gaps 
that were not resolved until recently. For example, while the density of normalized 
eigenvalues in matrix ensembles (Wigner’s semi-circle law) was known for all 
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ensembles where the entries were chosen independently from nice distributions, the 
spacings between adjacent normalized eigenvalues resisted proof until this century 
(see, among others, [41, 42, 137, 138]). 

The fact that random matrix theory also had a role to play in number theory 
only emerged roughly twenty years after Wigner’s pioneering investigations. The 
cause of the connection was a chance encounter between Hugh Montgomery and 
Freeman Dyson at the Institute for Advanced Study at Princeton. As there are now 
many excellent summaries and readable surveys of their meeting, early years and 
statistics (see in particular [73] for a Hollywoodized version), and the story is now 
well known, we content ourselves with a quick summary. For more, see among 
others [20, 21, 31, 32, 82, 84-86, 106]. 

As described in Sect. 3.1, Montgomery was interested in the class number, which 
led him to study the pair correlation of zeros of the Riemann zeta function. Given an 
increasing sequence {a,}°°, and a box B C R", the n-level correlation is defined by 


n=1 


lim # {(qj, ~ Ajgy e+ + 5 bj — Qj, ) € Bi # Ind (3.31) 
Noo N 


the pair correlation is the case n = 2, and through combinatorics knowing all the 
correlations yields the spacing between adjacent events. Montgomery was partially 
able to determine the behavior for the pair correlation. When he told Dyson his 
result, Dyson recognized it as the pair correlation function of eigenvalues of random 
Hermitian matrices in a Gaussian Unitary Ensemble, GUE. 

This observation was the beginning of a long and fruitful relationship between 
the two areas. At first it appeared that the GUE was the only family of matrices 
needed for number theory, as there was remarkable universality seen in statistics. 
This ranged from work by Hejhal [76] on the 3-level correlation of the zeros of ¢(s) 
and Rudnick and Sarnak [127] on the n-level correlation of general automorphic 
L-functions, to Odlyzko’s [109, 110] striking experiments on spacings between 
adjacent normalized zeros. In all cases the behavior agreed with that of the GUE. 

In particular, Odlyzko’s computations of high zeta zeros showed that, high 
enough along the critical line, the empirical distribution of nearest-neighbor spac- 
ings for zeros of ¢(s) becomes more or less indistinguishable from that of 
eigenvalues of random matrices from the Gaussian Unitary Ensemble, or GUE. 
The agreement with the first million zeros is poor, but the agreement near zero 
number 10!? is close, near perfect near zero number 10!°, and even better near zero 
number 107°. These results provide massive evidence for Montgomery’s conjecture, 
and vindicate Odlyzko’s choice of starting his search high along the critical line; 
see Fig. 5. 

In all of these investigations, however, the statistics studied are insensitive to the 
behavior of finitely many zeros. This is a problem, as certain zeros of L-functions 
play an important role. The most important of these are those of elliptic curve 
L-functions. Numerical computations on the number of points on elliptic curves 
modulo p led to the Birch and Swinnerton-Dyer conjecture. Briefly, this states that 
the order of vanishing of the L-function at the central point equals the geometric 
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Fig. 5 Probability density of the normalized spacings 6,,. Solid line: GUE prediction. Scatterplot: 
empirical data based on Odlyzko’s computation of a billion zeros near zero #1.3 x 10!°. (From 
Odlyzko [110, Fig. 1, p. 4]) 


rank of the Mordell-Weil group of rational solutions. The theorems on n-level 
correlations and spacings between adjacent zeros are all limiting statements; we 
may remove finitely many zeros without changing these limits. Thus these quantities 
cannot detect what is happening at the central point. 

Unfortunately for those who were hoping to distinguish between different 
symmetry groups, Katz and Sarnak [84, 85] showed in the nineties that the n-level 
correlations of the scaling limits of the classical compact groups are all the same 
and equal that of the GUE. Thus when we were saying number theory agreed with 
GUE we could instead have said it agreed with unitary, symplectic or orthogonal 
matrices. 

This led them to develop a new statistic that would be sensitive to finitely 
many zeros in general, and the important ones near the central point in particular. 
The resulting quantity is the n-level density. We assume the Generalized Riemann 
Hypothesis (GRH) for ease of exposition, so given an L(s,f) all the zeros are of the 
form 1/2 + iy;.¢ with y;.¢ real. The statistics are still well-defined if GRH fails, but 
we lose the interpretation of ordered zeros and connections with nuclear physics. 
For more detail on these statistics see the seminal work by Iwaniec et al. [83], who 
introduced them (or [2] for an expanded discussion). 

Let @; even Schwartz functions such that the Fourier transforms 


$0) = / dj(xe 7" dx (3.32) 
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are compactly supported, and set ¢(x) = TTja1 (xj). The n-level density for f with 
test function ¢ is 


Dilf.6) = Yo br (Levis) ++ On (Ly Vint) (3.33) 
een 
where Ly is a scaling parameter which is frequently related to the conductor. Given 
afamily F = Uy¥y of L-functions with conductors tending to infinity, the n-level 
density D,(.F,¢,w) with test function ¢ and non-negative weight function w is 
defined by 


D,(F,¢,w) = lim DpeGy PP uf) (3.34) 
Noo » fe Fn w(f) 

Katz and Sarnak [84, 85] conjecture that as the conductors tend to infinity, the n- 
level density of zeros near the central point in families of L-functions agree with the 
scaling limits of eigenvalues near | of classical compact groups. Determining which 
classical compact group governs the symmetry is one of the hardest problems in the 
subject, though in many cases through analogies with a function field analogue one 
has a natural candidate for the answer, arising from the monodromy group. Unlike 
the n-level correlations, the different classical compact groups all have different 
scaling limits. As the test functions are Schwartz and of rapid decay, this statistics 
is sensitive to the zeros at the central point. While it was possible to look at just one 
L-function when studying correlations, that is not the case for the n-level density. 
The reason is that while one L-function has infinitely many zeros, it only has a 
finite number within a small, bounded window of the central point (the size of the 
window is a function of the analytic conductor). We always need do perform some 
averaging; for the n-level correlations each L-function gives us enough zeros high 
up on the critical line for such averaging, while for the n-level density we must move 
horizontally and look at a family of L-functions. While the exact definition of family 
is still a work in progress, roughly it is a collection of L-functions coming from a 
common process. Examples include Dirichlet characters, elliptic curves, cuspidal 
newforms, symmetric powers of GL(2) L-functions, Maass forms on GL(3), and 
certain families of GL(4) and GL(6) L-functions; see for example [2, 3, 35, 36, 40, 
46, 53, 55, 67, 79, 80, 83, 85, 94, 101, 105, 116, 117, 122, 125, 126, 151, 152]. This 
correspondence between zeros and eigenvalues allows us, at least conjecturally, to 
assign a definite symmetry type to each family of L-functions (see [36, 130] for 
more on identifying the symmetry type of a family). 

There are many other quantities that can be studied in families. Instead of looking 
at zeros, one could look at values of L-functions at the central point, or moments 
along the critical line. There is an extensive literature here of conjectures and results, 
again with phenomenal agreement between the two areas. See for example [22]. 
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4 Future Trends and Questions in Number Theory 


The results above are just a small window of the great work that has been done 
with number theory and random matrix theory. Our goal above is not to write a 
treatise, but to quickly review the history and some of the main results, setting the 
stage for some of the problems we think will drive progress in the coming decades. 
As even that covers too large an area, we have chosen to focus on a few problems 
with a strong numeric component, where computational number theory is providing 
the same support and drive to the subject as experimental physics did years before. 
There are of course many other competing models for L-functions. One is the Ratios 
Conjectures of Conrey et al. [23, 24, 26]. Another excellent candidate is Gonek, 
Hughes and Keating’s hybrid model [65], which combines random matrix theory 
with arithmetic by modeling the L-function as a partial Hadamard product over the 
zeros, which is modeled by random matrix theory, and a partial Euler product, which 
contains the arithmetic. 

In all of the quantities studied, we have agreement (either theoretical or exper- 
imental) of the main terms with the main terms of random matrix theory in an 
appropriate limit. A natural question to ask is how this agreement is reached; in other 
words, what is the rate of convergence, and what affects this rate? In the interest of 
space we assume in parts of this section that the reader is familiar with the results 
and background material from [83, 127], though we describe the results in general 
enough form to be accessible to a wide audience. 


4.1 Nearest Neighbor Spacings 


We first look at spacing between adjacent zeros, where Odlyzko’s work has shown 
phenomenal agreement for zeros of €(s) and eigenvalues of the GUE ensemble. 
We plot the difference between the empirical and ‘theoretical, or ‘expected’ GUE 
spacings in Fig. 6. In his paper [110], Odlyzko writes: Clearly there is structure in 
this difference graph, and the challenge is to understand where it comes from. 
Recently, compelling work of Bogomolny et al. [12] provides a conjectural 
answer for the source of the additional structure in the form of lower-order terms 
in the pair correlation function for ¢(s). Though the main term is all that appears in 
the limit (where Montgomery’s conjecture applies), the lower-order terms contribute 
to any computation outside the limit, and would therefore influence any numerical 
computations like those of Odlyzko. By comparing a conjectural formula for the 
two-point correlation function of critical zeros of ¢(s) of roughly height 7 due 
to Bogomolny and Keating in [13] with the known formula for the two-point 
correlation function for eigenvalues of unitary matrices of size N, Bogomolny et. 
al. deduce a recipe for picking a matrix size that will best model the lower-order 
terms in the two-point correlation function, and conjecture that it will be the best 
choice for all correlation functions, and therefore the nearest-neighbor spacing. 
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Fig. 6 Probability density of the normalized spacings 6,. Difference between empirical distribu- 
tion for a billion zeros near zero #1.3 x 10!°, as computed by Odlyzko, and the GUE prediction. 
(From Odlyzko [110, Fig. 2, p.5]) 


More recently yet, Duefiez et al. [33, 34] have applied techniques of Bogomolny 
et. al. and others to studying lower-order terms in the behavior of the lowest zeros 
of L-functions attached to elliptic curves. Their results are currently being extended 
to other Z-functions by the first and third named authors here and their colleagues. 


4.2 n-Level Correlations and Densities 


The results of the studies on spacings between zero suggest that, while the arithmetic 
of the L-function is not seen in the main term, it does arise in the lower order terms, 
which determine the rate of convergence to the random matrix theory predictions. 
Another great situation where this can be seen is through the n-level correlations and 
the work of Rudnick and Sarnak [127]. They proved that the n-level correlations 
of all cuspidal automorphic L-functions L(s, 2) have the same limit (at least in 
suitably restricted regions). Briefly, the source of the universality in the main term 
comes from the Satake parameters in the Euler product of the L-function, whose 
moments are the coefficients in the series expansion. In their Remark 3 they write 
(all references in the quote are to their paper): 


The universality (in z) of the distribution of zeros of L(s, 2) is somewhat surprising, the 
reason being that the distribution of the coefficients a, (p) in (1.6), as p runs over primes, is 
not universal. For example, for degree-two primitive L-functions, there are two conjectured 
possible limiting distributions for the a,(p)’s: Sato-Tate or uniform distribution (with a 
Dirac mass term). As the degree increases, the number of possible limit distributions 
increases rapidly. However, it is a consequence of the theory of the Rankin-Selberg 
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L-functions (developed by Jacquet, Piatetski-Shapiro, and Shalika for m > 3) that all these 
limiting distributions have the same second moment (at least under hypothesis (1.7)). It is 
the universality of the second moment that is eventually responsible for the universality in 
Theorems 1.1 and 1.2. For the case of pair correlation (n = 2), this is reasonably evident; 
for n > 2 it was (at least for us) unexpected, and it has its roots in a key feature of “diagonal 
pairings” that emerges as the main term in the asymptotics of R,(T,f,h). 


Similar results are seen in the n-level densities. There we average the Satake 
parameters over a family of Z-function, and in the limit as the conductors tend to 
infinity only the first and second moments contribute to the main term (at least under 
the assumption of the Ramanujan conjectures for the sizes of these parameters). 
The first moment controls the rank at the central point, and the second moment 
determines the symmetry type (see [36, 130]). For example, families of elliptic 
curves with very different arithmetic (complex multiplication or not, or different 
torsion structures) have the same limiting behavior but have different rates of 
convergence to that limiting behavior. This can be seen in terms of size one over 
the logarithm of the conductor; while these terms vanish as the conductors tend 
to infinity, they are present for finite values. See [102, 104] for several examples 
(as well as [96], where interesting biases are observed in lower order terms of the 
second moments in the families). 


4.3 Conclusion 


The number theory results above may be interpreted in a framework similar to that 
of the Central Limit Theorem. There, if we have ‘nice’ independent identically 
distributed random variables, their normalized sum (standardized to have mean zero 
and variance 1) converges to the standard normal distribution. The remarkable fact is 
the universality, and that the limiting distribution is independent of the shape of the 
distribution. We quickly review why this is the case and then interpret our number 
theory results in a similar vein. 

Given a distribution with finite mean and variance, we can always perform a 
linear change of variables to study a related quantity where now the mean is zero 
and the variance one. Thus, the first moment where the shape of the distribution is 
noticeable is the third moment (or the fourth if the distribution is symmetric about 
the mean). In the proof of the Central Limit Theorem through moment generating 
functions or characteristic functions, the third and higher moments do not survive in 
the limit. Thus their effect is only on the rate of convergence to the limiting behavior 
(see the Berry-Esseen theorem), and not on the convergence itself. 

The situation is similar in number theory. The higher moments of the Satake 
parameters (which control the coefficients of the Z-functions) again surface only in 
terms which vanish in the limit, and their effect therefore is seen only in the rate of 
convergence. 
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This suggests several natural questions. We conclude with two below, which we 
feel will play a key role in studies in the years to come. These two questions provide 
a nice mix, with the first related to the main term and the second related to the rate 
of convergence. 


¢ Is Montgomery’s pair correlation true for all boxes (or test functions)? 
What about the n-level correlations, both for ¢(s) and cuspidal automorphic 
L-functions? Note agreement with random matrix theory for all these statistics 
implies the conjectures on spacings between adjacent zeros. 

¢ For a given L-function (if we are studying n-level correlations) or a family of 
L-functions (if we are studying n-level densities), how does the arithmetic enter? 
Specifically, what are the possible lower order terms? How are these affected by 
properties of the L-functions? If we use Rankin-Selberg convolution to create 
new L-functions, how is the arithmetic of the lower order terms here a function 
of the arithmetic of the constituent pieces? 


There are numerous resources and references for those wishing to pursue these 
questions further. For the n-level correlations, the starting point are the papers 
(76, 107, 127], while for the n-level densities it is [83-85]. 
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The Generalized Fermat Equation 


Michael Bennett, Preda Mihailescu, and Samir Siksek 


Abstract We survey approaches to solving the generalized Fermat equation 
x? + y! = Zz 


in relatively prime integers x, y and z, and integers p,q andr > 2. 


1 Le Roy est mort: Vive le Roy 


Pythagoras’ formula was purportedly kept secret within the closed circle of his 
initiates—but, as with any fact of nature, it became eventually widely known: 
the squares of the cathetes sum to the square of the hypotenuse. In the spirit 
of arithmetic, spread eight centuries later by Diophantus of Alexandria, one may 
instead phrase this statement in terms of integral solutions of the equation 


r+ y =, withx, y,z € N and gcd(x, y, z) = 1, (1) 


or, equivalently, ask for the coordinates of all rational points on the unit circle. All 
these are variants of the problem appearing in the second book of Diophantus, in 
the chapter numbered VUI—often quoted accordingly as Diophantus II. VIII. We 
nowadays call the solutions to Eq. (1) Pythagorean triples. Already in Diophantus 
one finds parametrizations for these solutions, given by 


M. Bennett 

Department of Mathematics, University of British Columbia, Vancouver, 
BC, Canada V6T 1Z2 

e-mail: bennett @math.ubc.ca 


P. Mihailescu (J) 
Mathematisches Institut der Universitat Gottingen, Géttingen, Germany 
e-mail: preda@uni-math.gwdg.de 


S. Siksek 
Mathematics Institute, University of Warwick, Coventry CV4 7AL, UK 
e-mail: s.siksek @ warwick.ac.uk 


© Springer International Publishing Switzerland 2016 173 
J.F. Nash, Jr., M.Th. Rassias (eds.), Open Problems in Mathematics, 
DOI 10.1007/978-3-319-32162-2_3 


174 M. Bennett et al. 


x = 2uv; y=uv—-v’; z=uw+v’, (2) 


where u and v are positive integers. This fact is easy to verify in one direction; the 
proof that all triples are parametrized in this form is one of the most popular of 
ancient mathematics and is widely taught to this day. 

The work of Diophantus on Arithmetic, a collection of 12 volumes written in 
Greek, was lost for centuries. Only in the late sixteenth century were half of these 
books, namely J-HI and VIII—X, rediscovered in the form of a later Byzantine 
transcription. These were translated into latin by Bombelli, and then subsequently 
published in Basel by Xylander. It was through the annotated translation of Gaspard 
Bachet from 1621 that the Arithmetica finally received a wide diffusion, capturing 
the interest of mathematicians of the epoch. Among them, Fermat, a lawyer from 
Toulouse, was particularly impressed by the beauty of Diophantus’ solution to (1). 
It is on the margin of the text to Diophantus’ Problem II. VIII that Fermat wrote in 
1634 his historical note concerning a short proof of the fact that the equation 


x" +y" =z", withx,y,z € Nand gced(x, y,z) = 1, (3) 


has no solution for n > 2, a proof which the margin of Bachet’s book was 
insufficiently large to contain. 

The assertion, henceforth bearing the name Fermat’s Last Theorem—hereafter 
denoted FLT for concision—remained an open problem for more than three 
centuries. Attempts to prove FLT led to some of the most significant developments 
in mathematics of the past three hundred years; it is fair to say that it is one of the 
problems that has generated the most mathematics in history. The first systematic 
approach, initiated by Kummer in the mid nineteenth century, was based on the 
theory of algebraic number fields and in particular cyclotomic fields. The conjecture 
was finally proved in 1994 by Wiles, with the help of Taylor, building on a series 
of ideas and results, due to Hellegouarch, Frey, Serre and Ribet, that connect the 
Fermat equation to elliptic curves, modular forms and Galois representations. 

Even before Wiles announced his proof, various generalizations of Fermat’s Last 
Theorem had already been considered, to equations of the shape 


Ax? + By? = Cz’, 


for fixed integers A, B and C. In the case where A = B = C = 1, for reasons we 
discuss later, we focus our attention on the equation 


1 1 1 
xv +yl=27', withx,y,zeN, ged(x,y,z)=land-+-+-<1l. (4) 
Pp qe 


Perhaps the only solutions to this equation are those currently known; i.e. those with 
(x, y, Z, p,q.) coming from the solution to Catalan’s equation 1? + 2? = 37, and 
from the following nine identities: 
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P4+P=3*, P4+13?=29, 27417 =71, 324114 = 122?, 
177 + 762713 = 210639282, 14143 + 2213459? = 657, 92623 + 15312283? = 1137, 
438 + 96222? = 30042907? and 33° + 15490347 = 15613°. 


In the mid 1990s, Andrew Beal, a graduate in mathematics with computational 
interests, in the course of carrying out calculations related to Fermat’s equation 
and its variations, noted that the solutions listed here all have the property that 
min{p, q,r} = 2 and made what is now termed the Beal conjecture, that there are 
no non-trivial solutions to (4), once we assume that min{p,q,r} > 3. A prize for 
the solution to this problem now amounts to one million U.S. dollars. Other names 
attached to conjectures about Eq. (4) include the Fermat—Catalan conjecture and the 
Tijdeman—Zagier conjecture. 

As we shall see, the few particular cases of these conjectures that have been 
investigated may well support the hope that this new generalization of Pythagoras’ 
initial equation will also stimulate fascinating new mathematics. 

Our main focus for the rest of the paper will be to describe two approaches to 
proving results about Eq. (4). The first of these is essentially a generalization of the 
cyclotomic methods that proved successful for Catalan’s equation. The second is the 
adaptation of Wiles’ proof of Fermat’s Last Theorem to handle many special cases 
of Eq. (4). 


2 Cyclotomic Approaches and Their Limitations 


Let us start by noticing that in both Eqs. (3) and (4) one may assume all exponents 
to be prime: indeed, if there is a solution with non-prime exponents, by raising 
the variables to some power, one obtains a solution with prime exponents. It thus 
suffices to consider this case and show that it leads to no non-trivial solutions. By an 
elementary observation, sometimes attributed to Euler, we have that 


xP + yP 
x+y 


G := gcd (: +y, 
is a divisor of p, provided that the integers x, y are coprime.! In this section, we will 
focus our attention on the special case of (4), given by the Fermat-Catalan equation 

x+y? = 27, withx,y,z € Nand gcd(x,y,z) = 1, (5) 


where one may hope to apply cyclotomic theory in a way somewhat analogous to 
that of the Fermat equation. Equation (5) also serves as a generalization of the binary 
Catalan equation 


vY-yl=l (6) 


'One verifies this by letting t = x + y and x = t + y, as well as using the fact that (t, y) = 1. 
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which also resisted solution for more than a century (and was finally solved by the 
second author, eight years after Wiles’ remarkable proof of Fermat’s Last Theorem). 
Here binary refers to the presence of only two unknowns in this equation, a fact 
which facilitates an analytic approach to bounding the possible solutions. 

Assuming that either (3) or (5) has a non-trivial solution for the prime p—or 
the prime pair (p, q)—one distinguishes the cases when G = 1 and G = p. The 
case where p + xyz has traditionally been termed the First Case of FLT, or FLT1; we 
retain this designation for the Fermat—Catalan equation. If G = p and thus p | xyz, 
one may assume, in Eq. (3) at least, that p | z. It is further easy to show that, in this 
case, 


x? + yP 
xty 


pt 


Since the Fermat equation is homogeneous, the assumption that x,y and z are 
pairwise coprime is straightforward—otherwise one could divide by the common 
divisor in a solution and obtain one in coprime integers. In the case of (5), however, 
with a little work, one can always construct infinitely many “trivial” solutions for 
which x, y and z fail to be coprime. 


2.1 History of “Fermat’s Last Theorem” 


As is well known, Fermat left no published proof of his conjecture. He did, however, 
provide a beautiful argument in the biquadratic or quartic case. To be precise, he 
considered the more general equation 


etya7 (7) 


and, using some astute manipulation of Pythagorean triples, proved that if (x, y, z) is 
a non-trivial solution of (7) in, say, positive integers, then one can construct a further 
positive solution (x’, y’, z’) of the same equation, in which z’ < z. By repeating the 
procedure one eventually finds a solution with z’ = 1, which implies that x’-y’ = 0,a 
contradiction. This was the first instance of the method of infinite descent in number 
theory. Euler gave a proof of the cubic case using such an argument; Gauss gave later 
an alternative proof using congruences. Although elementary, both methods require 
quite intricate computations and are not easy to memorize. We provide here a short 
elementary proof, which uses some more recent ideas, that date back to Wieferich 
and Furtwangler’: 


The second author found this proof, confronted with the own incapacity to recall the details of the 
classical proofs, for a seminar. It is possible that the proof may have been known, but we found no 
reference to it in the literature 
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Lemma 1. The equation x* + y? + 2 = 0 has no non-trivial integer solutions. 
Proof. Assume that (x, y, z) is a non-trivial solution in coprime integers. Let 


14+ ¥33 
2 


be a third root of unity and & = Z[p] be the ring of Eisenstein integers, which is a 
Euclidean ring. We assume first that 3 + xyz, so that both x + y, and x? — xy + y’ are 
cubes, say x + y = s°, and 


(x + py) = (a+ bp)? 


is the cube of a principal ideal. Exactly one of x, y, z must be even—we shall thus 
assume that y = 2v is even. Since the units of & are the sixth roots of unity, there is 
some 6 €< —p > such that 


S(x + yp) = (a+ bp) = a° + b? + 3abp(a + bp) 
=a +b’ —3ab* + 3ab(a— b)p. 


If 6 = +1, then comparing coefficients implies that y = 0 mod 3, contradicting our 
assumption. We thus have that 


x+typ=+p*(a+ bp), withe = +1. 
We have chosen y = 0 mod 2, whereby xp~° = w> mod 2& for some w = +(a + 
bp) € &. Butx = x + y = s?° mod 2, whence we may conclude that 


p ° = (w/s) mod 2. 


The ideal p := (2) C @& is prime; let m : & — F 2 be the natural projection. Since 
c # 0 mod 3, it follows that z(w/s) € F is a primitive ninth root of unity, an 
impossibility. It remains, then, to consider the case where 3 | xyz; we may suppose, 


without loss of generality, that 3 | z. Appealing to (9), we find that there is a root of 
unity 6 such that (aw) = 6 (2) = (a + bp)’ and x + y = 9s°. Since ie = -/, 
we obtain after dividing the previous identity by its complex conjugate, that there is 


another root of unity, say 6’, such that 


3 


2x-y+yV—3 b 
a acl Mel =9.( 2) = 8’ mod 3V-3-&. (8) 
X+yp 2Ix—-—y—yvV/-3 a+ bp 
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If 2x 4 y mod 3, letting y’ = y/(2x — y) mod 3, the last identities imply that 


1+yV-3 _ 


———— = 5' mod 3V-3-@, 
1—-yJ/-3 


whence 
y =y=0mod3, 


contradicting the assumption that 3 | z and gcd(x, y,z) = 1. Finally, suppose that 
2x — y = —3d. Inserting this value into (8) yields 


d./—3 
ON = ed aoe: 
d/—3-—y 


whereby 6’ = —1 and d = 0 mod 3. Then 2x — y = x + y = 0 mod 9. Summing 
these two congruences, we find 3x = 0 mod 9, and thus 3 | x, a contradiction which 
completes the proof. 

oO 
After Euler and Gauss, the quintic and septic cases of FLT were solved with 
contributions from Dirichlet, Lamé, Legendre and Cauchy. In his proof of the quintic 
case, Dirichlet distinguished the cases p + xyz and p | xyz, using descent for the 
proof of the second case. Lamé announced in 1841 a full proof of the general case 
of FLT. Unfortunately, his proof relied implicitly upon the (incorrect) assumption 
that the integers of the form 


p-2 
a= Dail 
k=0 


with a, € Zand ¢ a pth root of unity, form a ring with unique factorization. Kummer 
demonstrated that this assumption is, in general, false and that the smallest prime for 
which it fails is p = 23. In order to circumvent this difficulty, Kummer proceeded 
with an investigation of divisibility in the rings of algebraic integers of the pth 
cyclotomic field, and introduced the notion of ideal numbers, a larger group in which 
unique factorization was recovered. This work, stemming from a desire to attack the 
Fermat problem, led, in the following decades, to the theory of ideals and the work 
of Dedekind, giving rise to the fundamental results underlying what we presently 
know as algebraic number theory. 

If p is an arbitrary odd prime, we let K = QJ[€] denote the pth cyclotomic 
extension. The algebraic integers of this field are O(IK) = Z[¢] and the ideals of 
this ring factorize uniquely as products of prime ideals. If J is the semigroup of 
ideals and P the one of principal ideals, i.e. ideals generated by single elements, 
the quotient @(K) = //P is a finite abelian multiplicative group, the class group. 
The prime p is called regular, if p does not divide the size h(K) = |@(K)| of the 
class group, and irregular otherwise. With respect to the Fermat equation, we have 
in Z[C] one of the following factorizations: 
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p—1 
+ = ; 
xP =(x+y)- =(x+y)- x+yl°), or 
e+y-(AZr)-e+n- Tere 
vay (att 
z = pts +»)-(S A) = pty): ( ). 
p(x + y) li ae aa’ 
in case of FLT1 or FLT2, respectively. In either situation, writing a = 2+&, with 


(i—¢)°” 
e = Oin the first case and e = | in the second, one finds that a is coprime to p(x+y) 


and is, in fact, a pth power, albeit not one of an algebraic integer of K, but rather of 
the ideal 2( = (a, z). We thus have 


ze 


(a) = 2? and Nx/o(@) = peaty) 


(9) 


We note at this point that, in the case of the Fermat-Catalan equation, the same 
construction leads again to (w) = 2%. Using the connection between class groups 
and factorization of ideals, Kummer proved his fundamental result on Fermat’s 
Conjecture: 


Theorem 1 (Kummer). Equation (3) has no solutions for regular primes p > 2. 


For regular primes, it follows from (9) that there exists a p € Z[¢] such that 
(a) = (p)? is the pth power of a principal ideal. Starting from this, the proof of 
FLT1 is relatively simple. For the second case, however, Kummer appealed to a 
sophisticated variant of infinite descent in the real field Kt C K—the method bears 
currently his name, Kummer descent. A modern, complete proof of this result can 
be found in the book of Washington [36], Chapter IX. One finds in Rassias’ lovely 
introductory work for undergraduates [26], on page 147, more biographical details 
of Kummer’s life. 

Kummer’s work was followed by a century of active research on the Fermat 
equation, which led to the establishment of a large number of conditions known to 
imply the truth of the Fermat Conjecture—see e.g. Ribenboim’s famous survey [27]. 
However, before Wiles’s breakthrough, there were only two known results known 
to hold for infinitely many exponents, namely the “elementary” proof [34] given by 
Terjanian in 1977 for the fact that (3) has no solution for even exponents n > 2 with 
n + xy, as well as the deep analytic proof of Adelman, Fouvry and Heath-Brown, 
which showed that FLT 1 holds for infinitely many primes.* 


3One would expect, for various reasons, that regular primes occur more frequently than irregular 
ones. If we knew this, since it has been proved that there are infinitely many irregular primes, 
Kummer’s result would already imply that there are infinitely many primes p for which FLT holds. 
However no proof of the fact that the set of regular primes is infinite is known, even now. 
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There are, however, a number of results of interest on FLT that were established 
via cyclotomic methods before Wiles’ proof. We review here a few of the most 
important of them: 


(i) Wieferich and then Mirimanoff and Furtwangler proved that if FLT1 has a non- 
trivial solution, then 


a’-'=1modp’, fora € {2,3}. (10) 


Variations on this theme were treated during the following decades by, among 
others, Morichima, Lehmer, Skula, Granville and Monagan, the last two of 
these eventually proving that if FLT1 had non-trivial solutions, then (10) holds 
for all primes a < 89. With this Granville and Monagan were able to prove 
that FLT1 has no solutions for 


p < 714,591, 416, 091, 389. 


(ii) Eichler proved that if FLT1 has non-trivial solutions, then the p-rank of the 
p-part of the class group A := @(QlC]), of the pth cyclotomic field is 
necessarily large, namely p—rk(A,) = ./p — 1. He thus improved upon an 
earlier result of Krasner, who had proved that if FLT1 had solutions and p > 
no = (45!)88, then the Bernoulli numbers By-1-2;,4 = 1,2,..., | (log p)!/3 | 
had numerators divisible by p; this implies in particular that p—rk(A,) = 
[dog p)'/?]. 

(iii) In the first half of the twentieth century, Harry Schulz Vandiver wrote 
extensively on Fermat’s equation, partially fixing some gaps in earlier proofs 
of Kummer (and leaving a number of gaps himself, which were fixed only at 
the end of the century). We present below his main result as part of Theorem 2. 


Bearing in mind the fact that the Fermat Conjecture has been proved, it is still 
of interest to analyze other approaches which may provide alternative proofs of 
this Theorem. There are currently two primes known, for which the Wieferich 
condition (10) is satisfied with a = 2; none are known with a = 3. If one admits 
the heuristic assumption that the vanishing of a Fermat quotient 


p-\ 


a 
$p(a) = ———— mod p 
P 


has probability 1/p, one may expect on average O(log log(X)) primes p < X for 
which the quotient g,(a) = 0 for some fixed a < p. However, the same heuristic 
argument suggests that one can find at most one prime for which two or more Fermat 
quotients vanish simultaneously. One may formulate the following: 


Conjecture I. There exists a constant c > 2 such that for every prime p € N there 
are less than c values a € {2,3,...,p — 2} with g,(a) = 0. 


If c < 87, this conjecture implies FLT1. 
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Concerning the criteria in group (ii), Washington provides in [36] heuristic 
arguments suggesting that 


p—tk(Ap) « O(log(p)) 


for all primes. The Theorem of Eichler would imply FLT1 even if a rather weaker 
conjecture holds: 


Conjecture 2. There is an integer a > 2 such that for all primes p, the p part of the 
class group of the pth cyclotomic field has rank p—rk(A,) < ple. 


We have mentioned above that there are infinitely many irregular primes. The 
first of these were discovered by Kummer, the smallest one being p = 37. However, 
if one instead considers the largest real subfield contained in K, which is Kt = 
Q[¢ + ¢], the class number h* of this field is apparently much smaller and seems 
to never be divisible by p. Kummer was the first to suggest, in a letter to Kronecker 
from 1852 (see [23]), that this fact might hold for all p. The fact played an important 
role in many of the papers of Vandiver, who was seemingly unaware of Kummer’s 
letter. The assumption that p + hy = |@(K*)|, is therefore called the Kummer- 
Vandiver Conjecture, or simply the Vandiver Conjecture. The conjecture has also 
deep implications in K-theory; it has been verified numerically for all primes 
p < 27’. We should mention, however, that there are specialists who accept some 
heuristic arguments which suggest that the Conjecture might have counterexamples 
that are as scarce as the Wieferich primes. If this were true, there might be as many 
as log log(X) primes p < X for which the conjecture is false. Those who believe 
the Vandiver Conjecture are guided by the fact that there are numerous striking and 
rather improbable consequences to p | he and therefore the heuristic assumption 
that the value of the residue i mod p is uniformly distributed may be false. 

In the context of Fermat’s Last Theorem, two additional conditions of a rather 
specialized nature play a role; we formulate them also as assumptions: they have 
been computationally verified to hold within the same range as the Kummer— 
Vandiver Conjecture. 


Assumption C. Assume that the exponent of A, is p, whereby the p-part of the 
class group of the pth cyclotomic field is annihilated by p. 

Assumption D. Assume that all the units 6 € Z[¢, + oir for which there exists an 
algebraic integer p € Z[¢, + eI such that 6 = p” mod p?Z[é, + oh] are global pth 
powers. 


With these definitions, the following theorem holds: 


Theorem 2. Suppose that the Kummer—Vandiver Conjecture holds. If, additionally, 
Assumption C holds, then FLT1 has no solutions. If, instead, Assumption D holds, 
then FLT2 has no solutions. 


The first of these claims dates back to Vandiver, who, however supposed only that 
pt ie and had not noticed the necessity of Assumption C. The correct statement 
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was discovered by Sitaraman [32] in 1995. The second part of the theorem, together 
with its proof, are due to Kummer. We note that Theorem 2 provides the only known 
cyclotomic criterium which implies that there are no solutions to FLT2. 


2.2 The Catalan Equation 


Due to results of Victor Lebesgue (1853) and Chao Ko (1962), which eliminated 
the cases of even exponents, the Catalan Conjecture was reduced to proving that 
x? — yf = | has no solution with odd prime exponents (which are easily seen to 
be distinct). By considering cyclotomic factorizations, similar to the ones in (9), 
one obtains four cases. Cassels proved in 1962 that if (6) has a solution with odd 
exponents, then p | y and q | x, while 


xP — 1 


— vt 
p@— 1) 


for some rational integer v. One may define 


x= 
1=¢ 


and 2 = (a@,y), 


whence the analogue of Eq. (9) is (~) = 2. 

Catalan’s conjecture now follows from a combination of analytic methods with 
algebraic properties of cyclotomic fields. We present a brief exposition of some 
of the ideas that made this proof possible. Denoting by G the Galois group 
Gal(Q[Z,]/Q), one notices that the group ring F,[G] acts on the class a of the ideal 
1; in other words, linear combinations of the type 6 = )0,¢GNo - 0, in which the 
integers ng are identified with their remainders modulo gq, will act on the class a 
according to a° = [],¢go(a)"’. Since a! = 1, we see that it suffices to consider 
Ng € Fy. We call an element 6 € F,[G] an annihilator of a if a° = 1. 

Suppose that we are able to find a non-trivial annihilator (1 + j)@ € F,[G]— 
here we denote the restricted action of complex conjugation to K by j € G, so for 
instance a = @. Then a’ = (p)1, for some p € Kt; the equality between principal 
ideals translates into an identity between algebraic numbers, involving an unknown 
unit e: a = ¢- p?. Assume additionally, that there is a further 6’ € F,[G] such that 
ef € (IK*)4. Given this, one is able to prove, using additional arguments about the 
structure of units in K, that there is a number v € Z[f+ ¢] such that (x—C)?" = v4: 
note that we eliminated the denominator of a! That these favourable assumptions 
situation can be shown to occur follows from an important theorem of Francisco 
Thaine [35] (which also leads to a cyclotomic proof of the Main Conjecture of one 
dimensional Iwasawa Theory). 
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Continuing, one now finds a multiple y = )o jeg %oo € F,[G] of 6-0’ such that 
oeGg lo = hg for some h < po leading to the following equation 


vy =x(1—e/x)¥/4. 


The fact that v € R has the important advantage that the rapidly converging binomial 
series expansion of the expression 


(—¢/x** =|] @0-f/a)y4 (11) 


o€G 


will in fact converge to v/x" (rather than to some number that differs from v/x" by a 
qth root of unity, as would be true generally). With this, we obtain v = x’g(1/x) + 
F(1/x), with g € K[X] being a polynomial of degree h and F(T) € K[[T]] a power 
series. Finally, appealing to some lower bounds on A which were obtained by Hyyro, 
one can eventually show that, under the given arithmetic conditions, F(1/x) = 0. 
We thus have v = x"g(1/x), which leads to an arithmetic contradiction, completing 
the proof of Catalan’s Conjecture. 


2.3. The Fermat: Catalan Equation 


As previously mentioned, in the case that (x, y, z) is a non-trivial solution to the 
Fermat-Catalan equation (5) with odd exponents p and q, if we let a = — and 
21 = (a, z)—where e = 1 if p | z and e = 0 otherwise, then we have 27 = (a), a 
situation which is reminiscent of both the Fermat and the Catalan equations, and a 
starting point for cyclotomic investigations of (5). 

In this direction, the second author has tried to adapt the proof of Kummer’s 
Theorem to the case of Eq. (5). It would take too long to explain here the main points 
in which this equation differs from (3), necessitating the introduction of additional 
methods. Let us only mention that it is possible to restrict our attention to six cases, 
depending on whether or not p or q divide any of the factors of x- y-z- (x+y). 
After proving a generalization of Kummer’s descent to the p - gth cyclotomic field, 
it was often possible to either discard cases, or reduce them to conditions about 
the vanishing of some Fermat quotient—e.g. 27! = 1 mod q? or p?! mod q’. 
This approach succeeds in five cases. Unfortunately, in the sixth case, all classical 
Kummerian methods apparently fail. As a consequence, the second author was 
unable to find conditions on p and q which imply that (5) has no solutions. By 
symmetry, a set of such conditions could however be derived for the rational 
Catalan equation, i.e. the Eq. (6) in which x,y € Q is allowed. Note that, after 
clearing denominators, this equation is equivalent to 


XP 4 yP94 77=0, 
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which may be viewed as a “symmetrized Fermat—Catalan” equation. 

This first attempt to apply cyclotomic methods thus appears to confirm the 
somewhat pessimistic expectation that they are not sufficient for solving Eq. (5) in 
any generality. 


3 Fermat’s Last Theorem 


In the previous section, we discussed the cyclotomic approach to the Fermat 
equation and its potential limitations. We now sketch the approach of Hellegouarch, 
Frey, Serre and Ribet which culminated in Wiles’ proof of Fermat’s Last Theorem. 


Theorem 3 (Wiles [37]). The only integer solutions to the Fermat equation 
x" + y" — Zz 


with n = 3 satisfy xyz = 0. 


Recall that we call a solution trivial if xyz = 0, otherwise it is called non-trivial. 
Thus the theorem states that all solutions to the Fermat equation are trivial. As we 
have seen, the theorem is true for exponents n = 3 and 4. Thus it is sufficient to 
show, for primes p > 5, that all solutions to 


YP +y~p+/%=0 (12) 


are trivial. Of course, if (x,y,z) is a solution, we may by scaling suppose that 
gcd(x, y,z) = 1; we call such a solution primitive. The purpose of this section is 
to sketch the proof of Fermat’s Last Theorem and the ideas leading to it. The proof 
is based on three main pillars: 


(i) Mazur’s Theorem on irreducibility of Galois representations of elliptic curves; 
(ii) The modularity theorem, due to Wiles, Breuil, Conrad, Diamond and Taylor; 
(iii) Ribet’s level lowering theorem. 


Explaining these pillars will involve a detour into some of the most fascinating areas 
of modern number theory: elliptic curves, Galois representations, modular forms 
and modularity. 


3.1 Elliptic Curves 


There are many possible definitions of an elliptic curve. Let K be a field. An elliptic 
curve over K is a curve of genus | defined over K with a distinguished K-point. 
An alternative definition is: an elliptic curve over K is a 1-dimensional abelian 
variety over K. The simplest (though conceptually least enlightening) definition is: 
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an elliptic curve E over K is a smooth curve in P? given by an equation of the form 
Es ~etayet ay? =x + aye agxe’ + ag’, 


with a1, do, a3, a4 and ag € K. This in fact is a curve of genus 1, and the 
distinguished K-point is (x : y : z) = (0: 1 : 0). If the characteristic of K is 
not 2 or 3, then we can transform to a much simpler model given in A? by 


E: Y=X>+aX+b, (13) 
where a and b € K. We call this equation a Weierstrass model. Let 
A = —16(4a? + 27b’) 


which we call the discriminant of E (this is —16 times the discriminant of the 
polynomial on the right-hand side). The requirement that E' is smooth is equivalent 
to the assumption that A # 0. The distinguished K-point is now the ‘point at 
infinity’, which we denote by oo or @. Given a field L D> K, the set of L-points 
on E is given by 


E(L) ={@ ype? : y =P +axt+ dd U LG}. 


It turns out that the set E(L) has the structure of an abelian group with @ as the 
identity element. The group structure is easy to describe geometrically: three points 
P,, Pz, P3 € E(L) add up to the identity element if and only if there is a line @ 
defined over L meeting E in P;, Po, P3 (with multiplicities counted appropriately). 
The fact that E(L) is an abelian group (where the group operation has a geometric 
interpretation) ties in with the fact that F is an abelian variety. 


Theorem 4 (The Mordell—Weil Theorem). Let K be a number field and E an 
elliptic curve over K. Then E(K) is a finitely generated abelian group. 


When K is a number field we refer to the group E(K) as the Mordell-Weil group of 
E over K. 


Example J. As an example, consider the Fermat degree 3 equation over Q: 
vPty=2. (14) 


Viewed as a curve in P”, this is in fact a curve of genus 1. Let us choose the point 
(1 : —1 : 0) to be the distinguished point. We now transform this into a Weierstrass 
model using the transformation 


36(x — 12 
ya, See), X= 9 (15) 
x+y x+y 
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so that a solution to Eq. (14) corresponds to a rational point on the elliptic curve 
E: Y?=xX?— 432. 


The solution (1 : —1 : 0) to (14) corresponds to the point co = @ on E. The model 
E is the elliptic curve denoted by 27A in Cremona’s tables [8]. The group E(Q) has 
rank zero and, in fact, 


E(Q) = Z/3Z. 
Indeed, 
E(Q) = {@, (36,12), (36, —-12)} 
where in the group law on E we have 
2- (36, 12) = (36,—12) and 3- (36,12) =@. 


Thus the degree 3 Fermat equation (14) has exactly three solutions, and we may 
obtain these by taking the three points belonging to E(Q) and transferring them back 
to the model (14) using (15). Doing this, we find that the three solutions to (14) are 
(1:-—1:0), 1:0: 1) and (0: 1 : 0)—that is, just the trivial solutions. 


Example 2. One can similarly transform the equation 
y=xt+z (16) 
into the elliptic curve 
E: Y=xX?-4x 
which has Mordell—Weil group 
E(Q) = {@, (0,0), (2,0), (—2,0)} ~ Z/2Z x Z/2Z 


and use this information to deduce that the only solutions to (16) are the trivial ones. 


It turns out that the proofs by Fermat and Euler of the degree 4 and degree 3 cases of 
Fermat’s Last Theorem are simply special cases of what are now standard Mordell— 
Weil group computations. 

The degree p Fermat equation (12), viewed as defining a curve in P”, has genus 
(p—1)(p—2)/2, and thus does not define an elliptic curve for p > 5. We do mention 
in passing the following celebrated theorem of Faltings. 
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Theorem 5 (Faltings [13]). Let C be a curve of genus > 2 over a number field K. 
Then C(K) is finite. 


Faltings’ theorem tells us that for each p > 5 the Fermat equation (12) has finitely 
many primitive solutions. Faltings’ theorem is ineffective, in the sense that the proof 
does not yield an algorithm that is guaranteed to find all solutions. 


3.2 Modular Forms 


Let k and N be positive integers. We define 


Ty(W) = (3 ‘) € SL(Z) : c=0 (mod N)S. 


It is easy to see that [(N) is a subgroup of SL2(Z) of finite index. Let H be the 
complex upper-half plane 


H = {zeC : Im(z) > O}. 


The group I'9(N) acts on H via fractional linear transformations 


i 
(“*) Hea. so" 


cd czt+d- 


The quotient Yo(V) = Io(N)\H has the structure of a non-compact Riemann 
surface. This has a standard compactification denoted Xo(N) and the difference 
Xo(N) — Yo(N) is a finite set of points called the cusps. In fact the Riemann surface 
X(N) has the structure of an algebraic curve defined over Q and is an example of 
what is known as a modular curve. 

A modular form f of weight k and level N is a function f : H — C that satisfies 
the following conditions 


(i) f is holomorphic on H; 
(ii) f satisfies the property 


az+b\ _ k 
($=) = (cz + d)'f (2), (17) 


for all z € Hand (44) € To(N); 
(iii) f extends to a function that is holomorphic at the cusps. 


Observe that (||) € I'o(N). Thus by (ii), the function f satisfies f(z + 1) = f(z). 
Letting g(z) = exp(2ziz), we see, from the periodicity, that f must have a Fourier 
expansion 
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fo=> og 


n>No 


for some integer No. In fact, one of the cusps is the cusp at ioo which we can think 
of as being arbitrarily high up on the imaginary axis. Note that g(ioo) = 0. We see 
that for f to be holomorphic at the cusp ioo we require c, = 0 forn < 0. Thus we 
may write 


fo=> end". (18) 


n>0 


It turns out that the set of modular forms of weight k and level N, denoted by M;(N), 
is a finite-dimensional vector space over C. 

A cusp form of weight k and level N is a modular form f of weight k and level N 
that vanishes at all the cusps. As g(ioo) = 0 we see in particular that a cusp form 
must satisfy co = 0. The cusp forms naturally form a subspace of M;,(N) which we 
denote by S,(NV). Of particular interest are the weight 2 cusp forms of level N: these 
can be interpreted as regular differentials on the modular curves X(N). It follows 
that the dimension of S2(N) as a C-vector space is equal to the genus of the modular 
curve X(N). 

There is a natural family of commuting operators T, : S2(N) — S2(N) (with 
n > 1) called the Hecke operators. The eigenforms of level N are the weight 2 
cusp forms that are simultaneous eigenvectors for all the Hecke operators. Such an 
eigenform is called normalized if c; = | and thus its Fourier expansion has the form 


f=aqt So eng” 


n>1 


3.3 Modularity 


Let E be an elliptic curve over Q. Such an elliptic curve has a model of the form 
E: ytaxtay=x + agx” + asx + a6, (19) 


where the a; € Z, and a (non-zero) discriminant Ag which is an integer given by 
a complicated polynomial expression in terms of the aj. It is possible to change the 
model by carrying out a suitable linear substitution in x, y, and we generally work 
with a minimal model: that is one where the a; € Z with discriminant having the 
smallest possible absolute value. Associated to E is another, more subtle, invariant 
called the conductor Ng which we shall not define precisely, but we merely point 
that it is a positive integer sharing the same prime divisors as the discriminant; that 
it measures the “bad behavior’ of the elliptic curve E modulo primes; and that it can 
be computed easily through Tate’s algorithm [31, Chap. IV]. 
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Now let p + Ag be a prime. Then we can reduce the Eq. (19) to obtain an elliptic 
curve E over F,. The set E(,) i is an abelian group as before, but now necessarily 
finite, and we denote its order by #£ (F,,). Let 


a,(E) =p + 1—#E(F,). 


We are now ready to state a version of the modularity theorem due to Wiles, Breuil, 
Conrad, Diamond and Taylor [6, 37]. This remarkable theorem was previously 
known as the Taniyama—Shimura conjecture. 


Theorem 6 (The Modularity Theorem). Let E be an elliptic curve over Q with 
conductor N. There exists a normalized eigenform f = q + >~ cng" of weight 2 and 
level N such that cy € Z for all n, and if p + Ag is prime then cy = ap(E). 


In fact, for an eigenform f the Fourier coefficients are determined by the coefficients 
Cp with prime indices. Thus from the elliptic curve E we can construct the Fourier 
expansion of the corresponding eigenform f. What is astonishing is that f then 
satisfies the transformation properties (17). 


Example 3. We consider the following elliptic curve E over Q: 
E:y+y=x-x -10x—20. 


This has conductor 11, the smallest possible conductor, and discriminant —1 1°. The 
space S(11) is 1-dimensional. Naturally every non-zero element of $2(11) is an 
eigenform (i.e. an eigenvector for the Hecke operators), and we take as our basis the 
unique normalized eigenform which has the following Fourier expansion: 


f(2) =4q-2¢ —G +2q¢ +4 + 24° — 24’ --:- 


According to the modularity theorem the eigenform f corresponds to the elliptic 
curve E' and we may check a few of the coefficients to convince ourselves that this 
is the case. For example, 


EF.) = {6, (0,0), 0,1), 4,0), Cd, D}. 


It follows that a.(F) = 2+ 1 —#E(F2) = —2 agrees with the coefficient cp = 
—2 for q’ in the Fourier expansion for f. The reader can easily verify the relation 
dp(E) = Cp for the primes p = 3, 5 and 7. 


3.4 Galois Representations 


Let E be an elliptic curve over C. The structure of the abelian group E(C) is 
particularly easy to describe. There is a discrete lattice A C C of rank 2 (that is, as 
an abelian group A ~ Z7) depending on E, and an isomorphism 


E(C) ~ C/A. (20) 
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Let p be a prime. By the p-torsion of E(C) we mean the subgroup 
Elp] = {Q€ E(C) : pQ= 0}. 
It follows from (20) that 
Elp] ~ (Z/pZ)’. (21) 


This can be viewed as 2-dimensional F.,-vector space. 


Example 4. Let 
Bey ae +x. (22) 


It turns out that the corresponding lattice is A = Z + Zi. The p-torsion subgroup of 
C/A is 


at+bi 
p 


+A :a,b=0,...,.p—l). 


The reader will see that this a 2-dimensional F,-vector space with basis 1/p + A 
and i/p+ A. 


Now let E be an elliptic curve over Q. Then we may view E as an elliptic curve 
over C, and with the above definitions obtain an isomorphism E[p] ~ (Z/pZ)°. 
However, in this setting the points of E[p] have algebraic coordinates, and are 
acted on by Gg := Gal(Q/Q), the absolute Galois group of the rational numbers. 
Via the isomorphism (21), the group Gg acts on (Z/pZ)’. As noted, the latter is 
a 2-dimensional F,-vector space. We obtain a 2-dimensional representation that 
depends on the elliptic curve FE and the prime p: 


Bey : Go GLo(F,). (23) 


Example 5. We continue looking at the elliptic curve (22) but now regard it as an 
elliptic curve over Q. The 2-torsion subgroup is 


Recall that @ is the identity element. The three other elements of E[2] are points of 
order 2. Moreover, they satisfy the additional relation 


(0,0) + (,0) + (—i,0) = @. 
We can see this from the geometric description of the group law: the three points on 


the left-hand side are the intersection of the line y = 0 with E. As 2- (—i,0) = 0, 
we have that (—i, 0) = —(i, 0) and thus 
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(—i, 0) = (0,0) + @,0). 


We now see that E[2] is an F2-vector space with basis (0,0) and (i, 0). Let us now 
use this to write down fp; 5 explicitly. Let o € Gg. Then o(i) = ior o(i) = —i. 
Suppose first that o(i) = i. Then 


o(0,0) = (0(0),0(0)) = (0,0), a G0) = (0), (0) = (9). 


As o leaves our chosen basis fixed, we have 


Peale) = (57) € Gla. 


Suppose instead that o(7) = —i. Then 
a0 (0,0) = (a0 (0), 0(0)) = (0,0), o(i,0) = (oi), 0(0)) = (—i, 0) = (0,0)+(, 0). 


Thus the action of o with respect to our chosen basis is given by the matrix 


Pp2(o) = (; ) € GL; (F2). 


We record the image of the representation py 9: 


nsca=[(59)- QD) 


We note that the representation p, 5 is reducible, in the sense that all elements of the 


: ; 1 
image share a common eigenvector (; : 


We return to our general setting of an elliptic curve E over Q and a prime p. We 
say that the representation p, , is reducible if the matrices of the image py ,(Gg) 
share some common eigenvector. Otherwise we say that p; , 1s irreducible. 

We have now given enough definitions to be able to state Mazur’s theorem; this is 
often considered as historically the first step in the proof of Fermat’s Last Theorem. 


Theorem 7 (Mazur [24]). 


(i) Let E be an elliptic curve over Q and p > 163 be prime. Then pr, , is irreducible. 
(ii) Let E be an elliptic curve over Q with full 2-torsion (that is E[2| C E(Q)) and 
let p > 5 be prime. Then pr , is irreducible. 


It turns out that an elliptic curve E over Q such that p;,, is reducible corresponds 
to a rational point on the modular curve Xo(p) that is not a cusp. Mazur proved 
his theorem by determining the rational points on this infinite family of curves. 
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In a sense, Mazur’s theorem is not unlike Fermat’s Last Theorem, which is also a 
statement about the rational points on an infinite family of curves. 

We mention in passing the relationship between reducible mod p representations 
and isogenies. An isogeny of elliptic curves E, E’ is a non-constant map ¢ 
E — E' defined by algebraic equations that take the point at infinity on E to the 
point at infinity on E’. A non-trivial consequence of the Riemann—Roch theorem 
is that isogenies respect the group law, and so are in a sense algebro-geometric 
homomorphisms. A p-isogeny is an isogeny @¢ : E — E’ such that the kernel 
of ¢ has order p. Let E be defined over Q. Then the existence of an p-isogeny 
go : E — E’ defined over Q is equivalent to the representation Pr, being reducible. 
In fact, if Q is a non-zero element of the kernel of ¢, then Q is a non-zero eigenvector 
for all the elements of the image of fp; ,. We can restate (i) of Mazur’s theorem as 
saying that an elliptic curve E defined over Q has no p-isogenies for p > 163. 


3.5  Ribet’s Level Lowering Theorem 


Let E be an elliptic curve over Q. We saw above that, for each prime p, the curve 
E gives rise to a mod p Galois representation Pp, , : Gg — GL(F,). Let f be an 
eigenform. Deligne and Serre showed that such an f gives rise, for each prime p, 
to a Galois representation py, : Gg — GL» (Fr), where r > 1 depends on f. If 
E corresponds to f via the Modularity theorem (Theorem 6) then, unsurprisingly, 
Pep ~ Pr» (the two representations are isomorphic). Thus the representation p; , 
is modular in the sense that it arises from a modular eigenform. Recall that if f 
corresponds to E via modularity, then the conductor of E is equal to the level of f. 
Sometimes it is possible to replace f by another eigenform of smaller level which 
has the same mod p representation. This process is called level lowering. We now 
state a special case of Ribet’s level lowering theorem. Ribet’s theorem is in fact part 
of Serre’s modularity conjecture [29] that was proved by Khare and Wintenberger 
[20, 21]. 


Theorem 8 (Ribet’s Level Lowering Theorem [28]). Let E be an elliptic curve 
over Q with minimal discriminant A and conductor N. Let p = 3 be prime. 
Suppose 


(i) the curve E is modular; 
(ii) the mod p representation fp, , is irreducible. 


Let 


N, =N / Il ¢ (24) 
L\|N, 
p| orde(A) 


Then Ppp ~ Pep for some eigenform g of weight 2 and level Np. 
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Of course we now know, thanks to the Modularity theorem (Theorem 6) that 
all elliptic curves over Q are modular. Thus condition (i) in Ribet’s theorem is 
automatically satisfied. But we still include it for historical interest. 

We can make the relationship pz, ~ P,_ in Ribet’s theorem more explicit. Write 
&§ = 4+ >-,514nq" for the Fourier expansion of g. It turns out that the d, belong 
to the ring of integers Ox of a number field K that depends on g. The relationship 
Pep ~ Pep iS equivalent to the existence of a prime ideal 8 of Ox that divides pO x 
such that a,(E) = d, (mod §) for all primes q + Np. 


Example 6. Consider the elliptic curve 
E: yoxr-—x-71x+ 330 


with Cremona reference 132B1. Cremona’s database [8] gives us the minimal 
discriminant and conductor 


A=2*x 3" 11, N=2?x3xll. (25) 


The database also tells us that the only isogeny the curve E has is a 2-isogeny. Thus 
Px.» 18 irreducible for p > 3. We apply Ribet’s Theorem with p = 5. From the above 
recipe (24) for the level we find that N, = 44. It is possible to check that pz, ~ P,, 
where g is the following eigenform has weight 2 and level 44: 


g=4q4+q¢ —39 +2q' 29? —qit-. 


All the coefficients of g belong to Z. We tabulate a,(£) and the coefficients d, for 
primes qg < 50. The reader will note that the relationship a,(E) = d, (mod 5) holds 
for all primes g in the range except for g = 3 which does divide N. 


qd 2 3 5 7 |1l 413 /17 |19 |23 |29 |31 |37 | 41 43 | 47 
a (E) |0 |-1 2 2 fl 6 |-4 -2 |}8 0 0 |-6 0 10 0 
dy 0 1 1-3 2 |-l |-4 6 8 3 0 5 1 0 |-10 0 


3.6 The Proof of Fermat’s Last Theorem 


We are now able to sketch a proof of Fermat’s Last Theorem. Suppose p > 5 
is prime, and x, y and z are non-zero pairwise coprime integers such that 
x? + yP + zP = 0. We may reorder (x, y, z) so that y is even and x” = —1 (mod 4). 
We let E be the following elliptic curve which depends on the solution (x, y, z): 


BE: Y=X-(X=2)-(%+¥). (26) 
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The curve EF is called the Frey—Hellegouarch curve. The minimal discriminant and 
conductor of E are: 


2P y-P 2P 


a va N=[[¢. 


eA 


The choice 2 | y and x? = —1 (mod 4) ensures that 2 || N. 
We now consider p;,,,. The 2-torsion subgroup for E is 


E[2] = {@, (0,0), (x, 0), (—y’, 0)}. 


Note that E[2] C E(Q). As p => 5 we know by Mazur’s theorem (Theorem 7) 
that pz, is irreducible. Moreover E is modular by the Modularity theorem. 
The hypotheses of Ribet’s theorem are satisfied. We compute N, = 2 using the 
recipe in (24). It follows that pz, ~ P,, where g has weight 2 and level 2. A simple 
computation shows that there are no eigenforms of weight 2 and level 2. This 
contradiction completes the proof of Fermat’s Last Theorem. 


Some Historical Remarks In the early 1970s, Hellegouarch (e.g. [19]) had the 
idea of associating to a non-trivial solution of the Fermat equation the elliptic 
curve (26); he noted that the number field generated by its p-torsion subgroup 
E[p] has surprisingly little ramification. In the early 1980s, Frey [18] observed 
that this elliptic curve enjoys certain remarkable properties that should rule out 
its modularity. Motivated by this, in 1985 Serre [29] made precise his modularity 
conjecture and showed that it implies Fermat’s Last Theorem. Serre’s remarkable 
paper also uses several variants of the Frey—Hellegouarch curve to link modularity 
to other Diophantine problems. Ribet announced his level-lowering theorem 1987, 
thereby proving that modularity of the Frey—Hellegouarch curve implies Fermat’s 
Last Theorem. The proof of the Modularity theorem was completed around 1999 
by Breuil, Conrad, Diamond and Taylor [6]. A semistable elliptic curve is one 
with squarefree conductor. We note from (25) that the Frey—Hellegouarch curve is 
semistable. In 1994 Wiles [37], with some help from Taylor [33], proved modularity 
of semistable elliptic curves over Q, thereby proving Fermat’s Last Theorem. 


4 The (More) Generalized Fermat Equation 


We now return to the generalized Fermat equation 
we ee a (27) 
where x, y and z are integers, and the exponents p, q and r are (potentially distinct) 


positive integers. We restrict our attention to primitive solutions, i.e. those with 
gcd(x, y, z) = 1, since, without such a restriction, it is easy to concoct uninteresting 
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solutions in a fairly trivial fashion. Indeed, if we assume, say, that p,q and r are 
pairwise relatively prime, then we can choose integers u, v and w such that 


uqr = —1 (mod p), vpr = —1 (mod q) and wpg = —1 (mod r). 


If we are given any integers a, b and c witha + b = c, multiplying this equation by 
a?’ bP’c*P4I, we thus have that 


(aeart D/P per way? ae (a pePrtD/aewry? = (a@pr? cwpat D/r\" ; 


We call (p,q, 1r) the signature of Eq. (27). The behaviour of primitive solutions 
depends fundamentally upon the size of the quantity 


1 1 1 
OD.) Sot ete; 
Pq ¢ 
in particular, whether o(p,g,r) > 1, o(p,q,r) = 1 or o(p,q,r) < 1. If we set 
X = o(p,q,r) — 1, then x is the Euler characteristic of a certain stack associated 
to Eq. (27). It is for this reason that the cases o(p,q,r) > 1, o(p,g,r) = 1 and 


o(p,q,r) < 1 are respectively termed spherical, parabolic and hyperbolic. 


4.1 The Spherical Case o(p,q,r) > 1 


In this case, we may assume that (p,q,r) is one of (2,2,r), (2,q,2), (2,3, 3), 
(2,3,4), (2,4,3) or (2,3,5). In each of these situations, the (infinitely many) 
relatively prime integer solutions to (27) come in finitely many two parameter 
families (the canonical model to bear in mind here is that of Pythagorean triples); in 
the (most complicated) (2, 3, 5) case, there are precisely 27 such families, as proved 
by Johnny Edwards [11] in 2004 via an elegant application of classical invariant 
theory. In the case (p, g, r) = (2,4, 3), by way of example, we find that the relatively 
prime solutions x, y and z satisfy one of the following four parametrizations: 


x = Ats(s? — 3#2)(s4 + 622s? + 8124) (354 + 2775? + 32°), 
y = £(s* + 32) (s* — 188s? + 974), 
z= (st — 2f?s* + 9¢4)(s* + 302s? + 924), 


where s ¥ t (mod 2) and 3 + s, 
x = +£(4s* + 324)(16s® — 408¢4s* + 925), 


y = 6ts(4s* — 3r4), 
z= 16s + 1687454 + 928, 
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where ¢ is odd and 3 } s, 


x = +(5¢ + 1224)(s® — 408245* + 14428), 
y = 6ts(s* — 1274), 
z= s* + 168¢4s* + 14428, 


where s = +1 (mod 6), or 


x = 2(s* + 2ts? + 6125? + 285 + £*)(235° — 16ts? — 1722756 — 1128 5° 
—22¢4s* — 1124s? — 1729s? — 16275 + 232), 

y = 3(s —2t)(s + t)(s* + 84s? + 60752 + 85 4+ 24), 

z= 1358 + 160s’? + 28725 + 1127 5° + 2381454 
+1126 s3 + 284s? + 16t7s + 1328, 


where s ¥ t (mod 2) and s ¥ t (mod 3). Here, s and t are relatively prime integers. 
Details on these parametrizations (and much more besides) can be found in Cohen’s 
exhaustive work [7]. 


4.2 The Parabolic Case o(p,q,r) = 1 


If we have o(p,q,r) = 1, then, up to reordering, 
(p,9.r) = (2,6, 3), (2, 4, 4), (4, 4, 2), (3, 3, 3) or (2, 3, 6). 


As in Examples | and 2, each equation now corresponds to an elliptic curve of 
rank 0 over Q; the only primitive non-trivial solution comes from the signature 
(p,q.r) = (2,3, 6), corresponding to the Catalan solution 37 — 2? = 1. 


4.3 The Hyperbolic Case o(p,q,r) <1 


It is the hyperbolic case, with o(p,q,r) < 1, where most of our interest lies. Here, 
we are now once again considering the equation and hypotheses (4). As mentioned 
previously, it is expected that the only solutions to (4) are with (x,y, z,p,q,r) 
corresponding to the identity 1” + 23 = 37, for p > 6, or to 


GB+P=34, 7P+132?=29, 274+17=712, 35+ 114 = 1222, 
177 + 762713 = 210639287, 1414? + 2213459? = 657, 92623 + 15312283? = 1137, 
438 + 962223 = 300429072 and 338 + 1549034? = 15613°. 


The Generalized Fermat Equation 197 


A less ambitious conjecture would be that (4) has at most finitely many solutions 
(where we agree to count those coming from 1” + 2? = 3? only once). In the rest of 
this section, we will discuss our current knowledge about this equation. 


4.4 The Theorem of Darmon and Granville 


What we know for sure in the hyperbolic case, is that, for a fixed signature (p, q, r), 
the number of solutions to Eq. (4) is at most finite: 


Theorem 9 (Darmon and Granville [9]). /f A,B, C, p,q and r are fixed positive 
integers with 


then the equation 
Ax? + By? = Cz" 


has at most finitely many solutions in coprime non-zero integers x,y and z. 


Proof. The proof by Darmon and Granville is extremely elegant and we cannot 
resist giving a brief sketch. The hypothesis 1/p + 1/q + 1/r < 1 is used to show 
the existence of a cover@ : D—> P! that is ramified only above 0, 1, ~, 
where the curve D has genus > 2. Moreover, this cover has the property that the 
ramification degrees above 0 are all divisors of p, above 1 are all divisors of g, and 
above oo are all divisors of r. Now let (x, y, z) be a non-trivial primitive solution to 
the equation Ax’ + By? = Cz’. The above properties of the cover ¢ imply that 
the points belonging to the fiber @~!(Ax’/Cz") are defined over a number field 
K that is unramified away from the primes dividing 2ABCpgqr. It follows from a 
classical theorem of Hermite that there are only finitely many such number fields 
K. Moreover, by Faltings’ theorem, for each possible K there are only finitely many 
K-points on D. It follows that the equation Ax? + By? = Cz" has only finitely many 
primitive solutions. 


It is worth noting that the argument used in the proof is ineffective, due to its 
dependence upon Faltings’ theorem; it is not currently known whether or not there 
exists an algorithm for finding all rational points on an arbitrary curve of genus > 2. 
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4.5 A Brief Survey of What We Know 


What we would really like to do goes rather further than what the theorem of 
Darmon and Granville tells us. Indeed, we would like to obtain finiteness results for 
Eq. (4) where we allow the exponent triples (p, g, r) to range over infinite families. 
In the following tables, we list all known (as of 2015) instances where Eq. (4) has 
been completely solved. For references to the original papers we recommend the 
exhaustive survey [4]. The first table collects all known infinite families treated to 


date: 
(p.4,7) Reference(s) 
(n, n,n) Wiles, Taylor-Wiles 
(n,n, k), k € {2,3} Darmon-Merel, Poonen 
(2n, 2n, 5) Bennett 
(2,4,n) Ellenberg, Bennett-Ellenberg-Ng, Bruin 
(2, 6,n) Bennett-Chen, Bruin 
(2,n, 4) Bennett-Skinner, Bruin 
(2,n, 6) Bennett-Chen-Dahmen- Yazdani 
(37, 3k,n), j,k > 2 Immediate from Kraus 
(3, 3, 2n) Bennett-Chen-Dahmen- Yazdani 
(3, 6, n) Bennett-Chen-Dahmen- Yazdani 
(2, 2n,k), k € {9, 10, 15} Bennett-Chen-Dahmen- Yazdani 
(4, 2n, 3) Bennett-Chen-Dahmen- Yazdani 


(2j, 2k, n), j,k = Sprime,n € {3,5,7, 11, 13} 


Anni-Siksek 


Our second table lists “sporadic” triples where the solutions to (4) have been 
determined, and infinite families of exponent triples where the (p,q,r) satisfy 


certain additional local conditions. 


(p,q,r) Reference(s) 

(3, 3,n)* Chen-Siksek, Kraus, Bruin, Dahmen 
(2, 2n, 3)* Chen, Dahmen, Siksek 

(2, 2n, 5)* Chen 

(2m, 2n, 3)* Bennett-Chen-Dahmen- Yazdani 

(2, 4n, 3)* Bennett-Chen-Dahmen- Yazdani 

(3, 3n, 2)* Bennett-Chen-Dahmen- Yazdani 
(2,3,n),n€ Poonen-Schaefer-Stoll, Bruin, Zureick-Brown, Siksek, Siksek-Stoll 
{6, 7,8, 9, 10, 15} 

(3,4, 5) Siksek-Stoll 

(5,5,7), (7,7,5) | Dahmen-Siksek 
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The asterisk here refers to conditional results. For instance, in case (p,q,r) = 
(3,3,n), we have no solutions if either 3 < n < 10°, orm = +2 modulo 5, or 
n = +17 modulo 78, or 


n = 51, 103, 105 (modulo 106), 
or for n (modulo 1296) one of 


43, 49, 61,79, 97, 151, 157, 169, 187, 205, 259, 265, 277, 295, 313, 367, 373, 385, 403, 
421,475, 481, 493, 511, 529, 583, 601, 619, 637, 691, 697, 709, 727, 745, 799, 805, 
817, 835, 853, 907, 913, 925, 943, 961, 1015, 1021, 1033, 1051, 1069, 1123, 
1129, 1141, 1159, 1177, 1231, 1237, 1249, 1267, 1285. 


The results mentioned here have been proved by essentially two distinct methods. 
For a number of fixed triples, the problem has been reduced (via arguments similar 
to those of Darmon and Granville, or otherwise) to one of determining Q-rational 
points on certain curves of genus 2 or higher. These points were subsequently found 
via Chabauty-type methods and appeal to a version of the Mordell-Weil sieve. In 
each case where Eq. (4) has been solved for an infinite family of triples (p,q, 1), 
however, a different approach has been utilized, relying upon Frey—Hellegouarch 
curves and connections between them and modular forms. 


5 The Modular Approach and the Generalized 
Fermat Equation 


It is natural to ask if the proof of Fermat’s Last Theorem can be adapted to 
resolve (4), at least for certain signatures (p,q,r). Roughly speaking a Frey— 
Hellegouarch curve is an elliptic curve E over Q, attached to a solution of a 
Diophantine equation satisfying two conditions: 


(i) the discriminant of E has the form A- B? where A is a known (small) integer and 
p is a prime; 
(ii) every prime q | B divides the conductor exactly once. 


Examining the recipe (24) in Ribet’s theorem the reader will note that the level N, 
depends only on the known quantity A. For example, in the proof of Fermat’s Last 
Theorem, A is a power of 2 and the level N, = 2. 

Alas, only a few signatures have workable Frey—Hellegouarch curves. In the 
following table we record some of the known ones. 

These and similar Frey—Hellegouarch curves have been used to prove many of 
the results surveyed in Sect. 4.5. 
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Equation Frey—Hellegouarch Curve 

a? + bP =? | ¥? = X3 + 2cX? + aX 

a? + bP =e? | ¥? = X3 + 3cX? — 4b? 

O+b=c? | ¥2 = X3 + 3(a—b)X? +3(a@ —ab+ b?)X 
e+bh=c? | ¥? = X34 3bxX+2a 


5.1 A Sample Signature: (p,p, 2) 


To illustrate the approach we look specifically at the equation x”+y’ = z* where p > 
7 is prime. Here we follow the paper of Darmon and Merel [10] who showed that 
the only primitive solutions are (+1, 1,0), (1,0, +1), (0,1, +1). Let (@, y,z) = 
(a,b,c) be a primitive solution satisfying ab # 0. As in the preceding table, we 
associate to this the Frey—Hellegouarch curve 


E : ¥Y* = X34 2cx? + aX. 


This is modular by Theorem 6. By a variant of Mazur’s theorem (Theorem 7) the 
mod p representation fp, ,, is irreducible. Now an application of the Ribet’s theorem 
shows that pr, ~ P,, Where g is an eigenform of weight 2 and level 32. This is 
where we diverge from the proof of Fermat’s Last Theorem: there is an eigenform 
of weight 2 and level 32. It turns however that this eigenform is rather special as 
it corresponds to an elliptic curve with complex multiplication. It follows from this 
fact that p,,, : Gg — GL2(F;,) is not surjective. To complete the resolution of the 
equation x” + y? = z?, Darmon and Merel needed to show that if ab # —1 then Prep 
is in fact surjective and hence cannot be isomorphic to p, ,,. To do this, they showed 
that if P; , is not surjective then it gives rise to a rational point on one of a family of 
certain modular curves, and completed the proof by determining the rational points 
on this family. This last step is somewhat similar to the proof of Mazur’s theorem. 


6 Modularity over Number Fields 


Even when we are interested in Diophantine equations in rational integer unknowns, 
factorization arguments often force us to consider Diophantine equations where the 
coefficients or unknowns lie in a number field. Consider for example the equation 


a+h=c, gcd(a, b,c) = 1 (28) 


where the exponent p is prime. This equation is not known to have a Frey— 
Hellegouarch curve defined over Q. We can, however, factor the left-hand side as 
(a? + bi)(a? — bi) where i = //—1. It is not hard to show using the arithmetic of the 
Gaussian ring Z[i] that 
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at+bi=o, a —bi=a’. 
where a € Z[i] and @ is its conjugate. Eliminating b we obtain the equation 
a? + @? = 2a’. 


This is an (p,p,2) equation. However, unlike the equations we met in Sect. 5.1, 
some of the unknowns belong to Z|]. We ignore this uncomfortable fact for now and 
simply imitate the approach in the previous section to associate a Frey—Hellegouarch 
curve to this equation. The Frey—Hellegouarch curve is 


E: y = x + 2ax* + 2a? x (29) 


which has discriminant 2°(w@)’. We note that the discriminant is close to being 
a perfect pth power. To solve our original generalized Fermat equation (28) with 
signature (4,2, p) and unknowns belonging to Z, we need to consider an elliptic 
curve that is defined over Q(i). It is natural to ask how much of modularity and 
level lowering carry over to the setting of number fields. If we ask these questions 
for elliptic curves over general number fields then the answers are conjectural with 
almost no satisfactory theorems. However there are two situations where there are 
satisfactory theorems and these have been applied to certain generalized Fermat 
equations: Q-curves and elliptic curves over totally real fields. 


6.1 Q-Curves 


A Q-curve is an elliptic curve E over a number field K that is isogenous to all its 
conjugates. The Frey curve (29) is an example of a Q-curve: it is defined over the 
number field Q(i) and it happens to be isogenous to its conjugate y* = x* + 2ax” + 
2@?x (the conjugate is obtained simply by conjugating the coefficients of the elliptic 
curve). 

A consequence of the proof of Khare and Wintenberger of Serre’s modularity 
conjecture is that Q-curves are modular. The modularity of the Q-curve E given 
by (29) was used by Ellenberg [12] and by the first author, Ellenberg and Ng [5] to 
completely solve a+ + b* = c? showing in fact that there are no non-trivial primitive 
solutions with n > 4 (here n does not have to be prime). The first author and Chen 
[3] have used modularity of Frey—Hellegouarch Q-curves to show that the equation 
a’ + b® = c" has no non-trivial solutions with gcd(a, b,c) = 1 andn > 3. 


202 M. Bennett et al. 
6.2 Elliptic Curves over Totally Real Fields 


A number field K of degree n has n embeddings into the complex numbers 1; 

K — Cwithj = 1,...,n. For example if K = Q(6) is a number field of degree 
n, then 6 is the root of an irreducible degree n polynomial with rational coefficients. 
Such a polynomial has n distinct complex roots 6;,...,8,, and the embedding 1; 
satisfies 1;(0) = 6;. The embedding 1; is real if 1j(K) C R. Equivalently if 6; € R. 
If all the embeddings are real then we say that K is a totally real (number) field. An 
example of a totally real field is the cubic field K = Q(@) where 0 = & 4+ €5'. 
Here @ is a root of the polynomial x* + x* — 2x — 1. The roots 6; of the polynomial 
are 2 cos(27j/7) with j = 1, 2, 3 which are all real. 

Elliptic curves over totally real fields are conjecturally expected to be modular 
in the sense that they correspond to what are known as Hilbert modular forms 
(the classical modular forms of Sect.3.2 are a special case of Hilbert modular 
forms). There has been substantial progress towards proving modularity for elliptic 
curves over totally real fields thanks to the work of Barnet-Lamb, Breuil, Diamond, 
Gee, Geraghty, Kisin, Skinner, Wiles and many others (many of these results in 
fact integrate level lowering with modularity). Building on this work, modularity 
of elliptic curves over real quadratic fields was recently proved by Freitas, Le 
Hung and Siksek [14]. To solve Diophantine problems via Frey curves that are 
defined over totally real fields one needs not only modularity and level lowering, 
but also irreducibility theorems for mod p representations of elliptic curves. Over 
the rationals, as we saw in Sect. 3.4, this is provided by Mazur’s theorem. No such 
theorem is known over any number field other than Q. However Frey curves are 
almost semistable and this fact can usually be used [17] together with the celebrated 
uniform boundedness theorem of Merel [25] to supply the required irreducibility 
result. 

As an example we mention the equation 


a+b" =e, gcd(a, b,c) = 1. (30) 


studied by Anni and Siksek [1]. Here 2, m > 5 and p > 3 are primes. A complicated 
factorization argument is used to reduce this to a Fermat equation with signature 
(£,£,£) with coefficients and unknowns belonging to the totally real field Q(¢, + 
t e '). The corresponding Frey curves over this field are shown to be modular using 
the above-mentioned works for p = 3, 5, 7, 11 and 13. This is then used to show 


that the only solutions to (30) are the trivial ones. 


7 A Way Forward: Darmon’s Program 


The Frey—Hellegouarch approach used in the proof of Fermat’s Last Theorem and 
in the resolution of many other equations (as sketched in previous sections) attaches 
an elliptic curve to a hypothetical solution of the equation in question and then uses 
modularity to make deductions about this solution. It is natural to ask: 
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(i) Are there geometric objects other than elliptic curves that are somehow 
modular? 

(ii) If so, can these be used as an alternative, perhaps to add flexibility and tackle 
generalized Fermat equations for which no Frey—Hellegouarch elliptic curve is 
known? 


An abelian variety is a connected and projective algebraic group. Roughly 
speaking this means that it is defined by algebraic equations in projective space 
and carries a group structure (that happens to be abelian). An abelian variety has a 
dimension d > 1, and abelian varieties of dimension | are simply elliptic curves. 
Are abelian varieties over Q modular? The answer should be ‘yes’, except that the 
precise meaning of word modular in this generality is not yet resolved. 

An abelian variety of dimension d is said to be of GL2-type if its endomorphism 
ring is an order in a number field of degree d. A Theorem of Khare and Wintenberger 
[20] states that abelian varieties of GL2-type over Q are modular in a very precise 
sense (that is in fact very close to that of elliptic curves in Sect.3.3). Abelian 
varieties of GL2-type over totally real fields are expected to be modular in the sense 
that they correspond to Hilbert modular forms. Darmon exploits this idea to attack 
the equation x” + y? = z”, where p and r are primes and gced(x, y, z) = 1 as usual. He 
attaches a hypothetical non-trivial solution to an abelian variety of GL2-type over 
the totally real field Q(, + €~'). Using this he proves several beautiful theorems 
about possible solutions, though all are dependent on yet unproven conjectures. 
The biggest obstruction to Darmon’s program is the absence of a Mazur-style 
irreducibility theorem for mod p representations of abelian varieties of GL2-type. 

The Darmon program holds the greatest promise for further substantial progress 
on the generalized Fermat equation. Just as Frey’s original work was the spark that 
led to the formulation of Serre’s modularity conjecture, and the proofs of Ribet’s 
theorem and the Modularity theorem, so we hope that Darmon’s program will 
supply the impetus for new theorems for abelian varieties of GL2-type that in turn 
allow us to make deductions about the generalized Fermat equation. 
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The Conjecture of Birch and Swinnerton-Dyer 


John Coates 


Abstract The conjecture of Birch and Swinnerton-Dyer is one of the principal 
open problems of number theory today. Since it involves exact formulae rather than 
asymptotic questions, it has been tested numerically more extensively than any other 
conjecture in the history of number theory, and the numerical results obtained have 
always been in perfect accord with every aspect of the conjecture. The present article 
is aimed at the non-expert, and gives a brief account of the history of the conjecture, 
its precise formulation, and the partial results obtained so far in support of it. 


1 History 


The written history of the arithmetic of elliptic curves can be traced back at least to 
Arab manuscripts of over 1000 years ago, which were concerned with the problem 
of finding which positive integers are the areas of right-angled triangles, all of 
whose sides have rational length (traditionally, such positive integers are called 
congruent numbers). For example, 5 is a congruent number because it is the area of 
a right-angled triangle, whose sides have lengths 9/6, 40/6, 41/6. In fact no smaller 
congruent number was discovered by the ancients. It is easily seen that a positive 
integer N is a congruent number if and only if there is a point (x,y), with x and y 
rational numbers, and y ¥ 0, on the curve 


(1.1) y =x —N’x. 


In the seventeenth century, Fermat gave the first proof that | is not a congruent 
number, by introducing his method of infinite descent, and carrying it out on the 
curve (1.1) with N = 1. Fermat also noted that an intermediate step in his proof 
showed that, when n = 4, the curve x” + y” = z” has no solution in integers x, y, z 
which are all non-zero, and presumably this is what led him to claim that the same 
assertion holds for all n > 3. More generally, by an elliptic curve over a field F’, we 
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mean an irreducible non-singular projective algebraic curve of genus | defined over 
F, which is endowed with a given F-rational point O. Any such curve has a plane 
cubic model of the form 


(1.2) ytaxytay=x A gia? + aa + tie (a; € F), 


where O is now taken as the unique point at infinity (see, for example, [47]). Such an 
elliptic curve EF is an abelian variety of dimension |, meaning that the set of all points 
on such a curve with coordinates in some fixed extension field of F has a natural 
algebraic abelian group structure, with O as the zero element. In 1922, Mordell 
beautifully generalised Fermat’s infinite descent argument and proved that the group 
of rational points on every elliptic curve defined over Q is always finitely generated 
as an abelian group. However, the big mystery left open by Mordell’s proof was 
whether or not the procedure of infinite descent always terminated in a finite number 
of steps, thus enabling one to actually determine the group of rational points on the 
curve. In practice, this always seems to be the case, but, in fact, it has never been 
proven theoretically. The villain of the piece is a mysterious group, subsequently 
called the Tate-Shafarevich group of the elliptic curve, which is defined by 


IIM(E/Q) = Ker(H'(Q, E) > | [H'@Q..5), 


where v runs over all places of Q, and Q, is the completion of Q at v. This torsion 
abelian group is always conjectured to be finite, but today we can still only prove 
this under a very restrictive hypothesis discussed below. 

The discoveries of Birch and Swinnerton-Dyer came as a great surprise to the 
mathematical world when they first became public around 1962. Starting in the 
autumn of 1958, they had carried out a series of brilliantly planned numerical 
experiments on the early EDSAC computers in Cambridge, whose aim was to 
uncover numerical evidence for the existence of some kind of analogue for elliptic 
curves of the mysterious exact analytic formulae proven by Dirichlet for the class 
numbers of binary quadratic forms, and powerfully extended to all quadratic forms 
by Siegel. Even though Siegel’s work had been actively developed further for linear 
algebraic groups around this time by Kneser, Tamagawa, Weil, and others, it was 
Birch and Swinnerton-Dyer alone who first sought, and later found evidence for, an 
analogue for elliptic curves. It surely is one of the great mysteries of number theory, 
first uncovered by Birch and Swinnerton-Dyer, that purely arithmetic questions 
about the determination of E(Q) and IlI(E/Q) for an elliptic curve E over Q seem 
to be inextricably involved with the behaviour of the complex L-function of E. 

In this survey article, we shall mainly discuss the conjecture of Birch and 
Swinnerton-Dyer in the most important and down to earth case of elliptic curves 
defined over Q. However, the conjecture extends without difficulty to abelian 
varieties of arbitrary dimension defined over either a finite extension of Q, or over 
a function field in one variable over a finite field (see [51]). To date, very little has 
been proven about the conjecture for general abelian varieties of dimension > 1 
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over number fields. However, for abelian varieties defined over a function field in 
one variable over a finite field, the remarkable work of Artin and Tate [51] makes 
great progress on the conjecture, apart from the mysterious question of the finiteness 
of the analogue of the Tate-Shafarevich group. 


2 L-Functions 


Let E be any elliptic curve defined over Q. By a global minimal Weierstrass equation 
for E, we mean any equation for E of the form (1.2), whose coefficients a; are all 
integers, and whose discriminant A is as small as possible in absolute value (for the 
definition of A, and other facts about the elementary geometry of elliptic curves see 
[47]). Such equations are not unique, but we fix any one of them for the discussion 
which follows. Like all the L-series of arithmetic geometry, the complex L-series 
of E is defined by an Euler product. For each prime number p, define N, by letting 
N, — 1 denote the number of solutions of the congruence 


yr + ayxy+ ay = x? + ax + asx + ag mod p, 
and then put 
tb =pt+1—N,. 


If (p, A) = 1, we have |t,| < 2.,/p by Hasse’s theorem. If p divides A, then ¢, = 1 
if E has multiplicative reduction at p with tangents at the node defined over F,,, 
t) = —1 if E has multiplicative reduction at p with tangents at the node not defined 
over F,,, and t, = 0 when E had additive reduction at p. The complex L-series of E 
is then defined by the Euler product 


(2.1) L(E,s) = []Q-s")" I (pp tp)". 


pP\|A (p,A)=1 


This Euler product defines a Dirichlet series 


[o,@) 
LE ss ) on, 
1 


n= 


where c, = ft, for every prime p, and which converges in the half plane Re(s) > 3. 
When Birch and Swinnerton-Dyer first began their calculations, it was only known 
how to analytically continue this function to the entire complex plane when EF has 
complex multiplication (i.e. the ring of endomorphisms of E, which are defined over 
C, is strictly bigger than Z), using ideas about Eisenstein series which go back to 
Eisenstein and Kronecker, and which were subsequently developed systematically 
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by Deuring [17]. To prove the analytic continuation for all E, we need the following 
fundamental result, the essential idea behind the proof of which we owe to Wiles 
[57] (see also [5]). The conductor C(E) of E is the positive integer defined by 


C(E) = |p". 


piA 


where f, = 1 if E has multiplicative reduction at p, and f, = 2 + 4, for some integer 
6, = 0 if E has additive reduction at p. Moreover, in this latter case, 5, = 0 when 


p = 5. Let [o(C(E)) be the subgroup of SL2(Z) consisting of all matrices (< ) 
c 


27 iT 


with c = 0 mod C(E£). Let H be the complex upper half plane, and put q = e 
with t € H. Define 


oe) 


Se(t) = > Cnq’'. 


n=1 


Theorem 2.1. The holomorphic function fg(t) is a primitive cusp form of weight 2 
for To(C(E)). 


By a generalization of classical ideas of Hecke, this theorem not only proves that 
L(E, s) can be extended to an entire holomorphic function of s, but it also establishes 
the following functional equation. Define 


A(E, s) = C(E)*/?(2) “TT (s)L(E, s). 


Corollary 2.2. The function A(E, s) can be extended to an entire function of s, and 
satisfies the functional equation 


(2.2) A(E, s) = weA(E,2—-s), 


where we = +1. 


The so called root number wz = +1 is important for us because we see immediately 
from (2.2) that L(E, s) has a zero at s = | of even or odd multiplicity, according as 
We = +1 or we = —1. Moreover, the theory of L-functions shows that wr can 
always be calculated as a product of purely local factors. For example, if E is taken 
to be the curve (1.1) with N a square free positive integer, then we = +1 when 
N = 1, 2, 3 mod 8, and wg = —1 when N = 5, 6, 7 mod 8, whence, in particular, 
L(E, s) always has a zero at s = 1 whenever N = 5, 6, 7 mod8. 

We mention that one can, more generally, consider elliptic curves E which are 
defined over some finite extension F of Q. Again the group of F-rational points 
on F is a finitely generated abelian group, and again no algorithm has ever been 
proven for infallibly determining this group, again thanks to our lack of knowledge 
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of the finiteness of the Tate-Shafarevich group of such a curve. Of course, these 
elliptic curves also have a complex L-series, which we now denote by L(E/F,s), 
which is defined in the region Re(s) > 3/2 by an entirely analogous Euler product 
to (2.1), but taken over all finite places of the field F. When E admits complex 
multiplication, the analytic continuation and functional equation of L(E/F, s) follow 
immediately from Deuring’s theorem [17], which identifies L(E/F, s) with a product 
of Hecke L-series with Grossencharacters. However, when E does not have complex 
multiplication, our knowledge of the analytic continuation of L(E/F, s) is still very 
limited, with the most striking results established so far being proofs of this assertion 
either when F is a real quadratic field [19], or when F is any finite extension of 
Q which is contained in the cyclotomic Z,-extension of Q for any prime number 
p [53]. 


3 The Birch-Swinnerton-Dyer Conjecture 


We will now state the conjecture of Birch and Swinnerton-Dyer in both its weak and 
strong form, and discuss the evidence for it in subsequent sections. The conjecture, 
which was first published in [3], predicts a remarkable link between the arithmetic 
of an elliptic curve E defined over Q, and the behaviour of its complex L-series 
L(E, s) at the point s = 1. Let gz denote the rank of E(Q) (.e. the Q-dimension of 
E(Q) @z Q). We define 


Definition 3.1. rg is the order of the zero of L(E,s) at s = 1. 


Weak Birch-Swinnerton-Dyer Conjecture. For all elliptic curves E over Q, we 
have 


The full Birch-Swinnerton-Dyer conjecture is the weak Birch-Swinnerton-Dyer 
conjecture, together with a purely arithmetic exact formula for the constant L¢ 
defined by 


(3.2) £p = lim L(E, s)/(s— 1)". 


This formula involves the following arithmetic invariants. Firstly, there is a regulator 
term coming from the Neron-Tate height. If @ = m/n, with m and n relatively prime 
integers, is any non-zero rational number, we define its height h(a) by h(a) = 
log(max(|m|, |n|)), and put (0) = 0. Then Neron and Tate proved independently 
(see [47], Chap. 8) that there is a unique function 


h:E(Q)—>R 
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such that h(O) = 0, and, as P runs over the non-zero points in E(Q), we have 
h(2P) = 4h(P) and |A(P) — h(x(P))| is bounded independent of P, where x(P) 
denotes the x-coordinate of P in any fixed generalised Weierstrass equation (1.2). 
Then the function on E(Q) x E(Q) defined by 


Definition 3.2. (P,Q) = 5 (nr ® Q) —h(P) — iO) 


is biadditive. Moreover, we have h(P) = 0 if and only if P is a torsion point in 
E(Q). If one uses, in addition, the fact that there are only finitely many points P in 
E(Q) with h(P) < c for any constant c > 0, it follows, as was first remarked by 
Cassels, that h induces a positive definite quadratic form on E(Q) ®z R. Hence, for 
any choice of gz independent points P;,..., P,, in E(Q), we always have that the 
determinant det(P;, P;) is strictly positive. We then define Roo(E) = 1/#(E(Q)”) if 
ge = Oand 


Definition 3.3. Roo(E) = det(P;, P;)/[EQ) : 2, ZPi}* if gz > 0. 


We assume that we have fixed any global minimal Weierstrass equation for EF, and 
we define Qz to be the least positive real period of the Neron differential on E, 
which is given by 


dx 
o = ————_. 
2y + ax + a3 


The next subtle ingredient in the conjectural exact formula for fg are the so 
called Tamagawa factors, which are purely local terms occurring for the prime at 
infinity, and the finite primes q dividing the conductor C(E) of E. 


Definition 3.4. coo(E) is equal to 1 or 2, according as the group of points E(R) on 
E with real coordinates has | or 2 connected components. 


Next assume that q is any prime number dividing the conductor C(E). Let Q, be the 
completion of Q at g. Now, since q is a prime of bad reduction for E, the reduction 
of E modulo q will be a cubic curve with a singular point, and we define Ey(Q,) 
to be the subgroup of E(Q,) consisting of all points whose reduction modulo g is 
non-singular. Since we are working with a generalised Weierstrass equation which 
is minimal at g, the index of Eo(Q,) in E(Q,) will be independent of the choice of 
the Weierstrass equation. 


Definition 3.5. For a prime g of bad reduction, cg(E) = [E(Q,) : Eo(Q,)]. 


In general, there is no simple formula for c,(E), but Tate [50] gave an explicit 
algorithm for computing c,(£) from any generalised Weierstrass equation for E 
which is minimal at qg, and also proved:- 


Lemma 3.6. /fE has split multiplicative reduction at q, then cq(E) = ord,(A). For 
all other primes q of bad reduction, cq(E) < 4. 


We can now at last state the full Birch-Swinnerton-Dyer conjecture. 
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Full Birch-Swinnerton-Dyer Conjecture. We have rg = gr. Moreover, UI(E/Q) 
is finite, and the following exact formula is valid 


E 


@.3) SE = #(T1(E/Q))Reo(E)coo(E) [] ea(E). 


Q -—4 
B alC(E) 


Note that elliptic curves which are Q-isogenous have the same L-functions. 
However, it is not at all obvious that the exact formula (3.3) being valid for an 
elliptic curve implies that it is valid for any isogenous curve, but this was proven by 
Cassels [8] and Tate [51]. 

We shall spend most of the rest of this article discussing the fragmentary 
theoretical results proven so far, in the direction of both the weak and full Birch- 
Swinnerton-Dyer conjecture. However, to illustrate immediately the limits of our 
present knowledge, let us note three simple consequences of the conjecture which 
have never been established for a single elliptic curve E over Q. Firstly, it has never 
been proven that there exists an elliptic curve E defined over Q with rg > 4, even 
though there are many examples of such E with gz > 4. Secondly, it has never been 
proven that III(£/Q) is finite for a single elliptic curve E with rg > 2. Thirdly, 
it has never been proven that £¢/(QeRoo(E)) is a rational number for a single E 
with rg = 2. 

The earliest numerical work in support of these conjectures is given in the papers 
[3, 49]. Today, the numerical evidence in support of both the weak and full Birch- 
Swinnerton-Dyer conjecture is overwhelming, and probably more extensive than 
for any other conjecture in the history of mathematics. Access to the vast amount 
of numerical data, which, to date, confirms experimentally every aspect of the 
conjecture, can be made at the website www.lmfdb.org/EllipticCurve/Q (see also 
the earlier book [15], which is available online on John Cremona’s homepage at 
Warwick University). This website includes tables of all elliptic curves EF over 
Q with conductor C(E) < 360,000. There are 2,247,187 elliptic curves in this 
table, lying in 1,569,126 Q-isogeny classes. All such curves have gg < 4, and in 
fact there is only one curve in the table with gg = 4 (this curve has conductor 
234,446). In addition, Miller, Stoll, and Creutz [14, 35, 36] have verified the full 
Birch-Swinnerton-Dyer conjecture for all E defined over Q with C(E) < 5000, 
which have rg < 1. The analytic quantity £¢ can be computed numerically to great 
accuracy irrespective of the value of rg, and the same is usually true for all quantities 
occurring in the exact formula (3.3), except for the order of the Tate-Shafarevich 
group of E. Even when ILI (E/Q) is known to be finite, it is very difficult to actually 
compute its true order arithmetically. However, even granted this difficulty, there is 
one subtle sub-test of the order of H1(E/Q) as predicted by (3.3) being correct. As 
we shall explain in the next section, an important theorem of Cassels [9] proves that 
if I11(E/Q) is finite, then its order must be the square of an integer. Happily, in all 
of the vast number of numerical examples computed to date, the formula (3.3) has 
always produced a conjectural order for the Tate-Shafarevich group which is indeed 
the square of an integer. 
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4 Parity Questions 


The only deep general results known about the conjecture of Birch and Swinnerton- 
Dyer, which do not involve in some fashion the hypothesis that the L-series of the 
curve has a zero at s = | of order at most 1, are parity theorems of two kinds. As 
always, E will be an elliptic curve defined over Q, and II(£/Q) will denote its 
Tate-Shafarevich group. If A is any abelian group and p a prime number, we write 
A(p) for the p-primary subgroup of A, and A[p] for the kernel of multiplication by 
p onA. Also, Agiy will denote the maximal divisible subgroup of A. The following 
theorem is due to Cassels [9] and Tate [52]. 


Theorem 4.1. There is a canonical non-degenerate, alternating, bilinear form on 


TI(E/Q)/MI(E/Qhaiv- 


Corollary 4.2. For every prime p, the F,-vector space given by the kernel of 
multiplication by p on WI(E/Q)/U1(E/Q)aiv has even dimension. 


Corollary 4.3. If p is a prime such that U(E/Q)(p)av = 0, then the order of 
IN (E/Q)(p) is a square. 


In particular, if II(E£/Q) is finite, its order must be a perfect square. We note 
that, for every prime p, classical Galois cohomology shows that, for some integer 
te» = 0, one has 


IN(E/Q)(p) = (Q,/Z,p)*” @ JEp; 


where Jz, is a finite group. Plainly INI(E/Q)(p)a» = (Q,/Z,)*". Of course, 
conjecturally tg, = O for every prime p, but the only far weaker general known 
result in this direction is the following parity theorem of the Dokchitser brothers 
[16]. Recall that g_ denotes the rank of E(Q), and rg denotes the order of zero of 
L(E, s) at the point s = 1. 


Theorem 4.4. For every prime number p, we have rg = gr + tgp mod 2. In 
particular, the parity of tz) is independent of p. 


As a simple application of this theorem, we see that if there did exist a square free 
positive integer N with N = 5,6, 7mod 8, which is not a congruent number, then 
the p-primary subgroup of the Tate-Shafarevich of the elliptic curve (1.1) would 
have to contain a copy of the divisible group Q,/Z, for every prime p, and so a 
copy of Q/Z. We also note that the strong parity conjecture is the assertion that 
rg = gemod 2, but this has only been proven at present under the assumption that 
rg < 1, when, as we shall see in the next section, we even have rg = gz, in accord 
with the weak Birch-Swinnerton-Dyer conjecture. 
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5 Main Results 


As before, let E denote any elliptic curve defined over Q. Define the modular curve 
Xo(C(E)) by 


Xo(C(E)) = To(C(E)) \(HUP'@), 


where P!(Q) denotes the projective line over Q. Then Xo(C(E)) is the set of 
complex points of a curve defined over Q, which we also denote by Xo(C(E)). 
Let [co ] denote the cusp oo of this curve. The modularity Theorem 2.1 of Wiles 
[5, 57], when combined with work of Shimura [46], shows that there exists an 
elliptic curve E(f;) defined over Q, which is a factor up to isogeny of the Jacobian 
variety of Xo(C(E)), and has the same L-series as the elliptic curve E. Hence, by 
Faltings theorem [18], E and E(f;) must be isogenous over Q, whence we obtain 
the following result. 


Theorem 5.1. There is a non-constant rational map defined over Q 
(5.1) @:Xi(C(E)) > E 


with @( [oo }) = O. 


The most important result to date in the direction of the conjecture of Birch 
and Swinnerton-Dyer is the following theorem of Kolyvagin and Gross-Zagier (see 
[23]). We again write rg and gz for the order of the zero of the complex L-series of 
E at s = 1, and for the rank of E(Q). 


Theorem 5.2. [fre < 1, then rg = gp, and W(E/Q) is finite. 


The proof relies heavily on the earlier work of Gross-Zagier [24], relating the 
canonical height of Heegner points to derivatives of L-functions, as well as on 
Kolyvagin’s highly original notion of an Euler system [27]. Heegner points were 
first discovered, in a special case, by Heegner in his celebrated paper [25], and it 
was Birch and Stephens who first conjectured that they should be related to the 
derivatives of L-series. We also note that the proof of the above theorem establishes 
the following rationality result. 


Theorem 5.3. If rg < 1, then Sg/(QeERoo(E)) lies in Q. 


When rg = 0, we know by Theorem 5.2 that gg = 0, whence R.(E) = 
1/#(E(Q))?, and Theorem 5.3 in this case is just a consequence of the classical 
theory of modular symbols going back to the work of Hecke and others (see 
[15]). However, when rg = 1, so that gg = 1 by Theorem 5.2, the assertion of 
Theorem 5.3 can only be proven by using the Gross-Zagier theorem [24]. Moreover, 
we Stress that, contrary to what is often stated in the literature, we still cannot prove 
the Birch-Swinnerton-Dyer conjectural exact formula for the order of HI(E/Q) 
under the hypothesis that rg < 1. Finally, we note that some special cases of 
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Theorem 5.2 were proven earlier for elliptic curves with complex multiplication 
by rather different methods, which make use of elliptic units rather than Heegner 
points (see [12, 42-44]). In another direction, Zhang [59] has generalised the above 
results to the Jacobian varieties of Shimura curves defined over totally real number 
fields. 

Theorem 5.2 has a surprising application to Gauss’ class number problem for 
imaginary quadratic fields, as was discovered by Goldfeld [20] (see also [38]). If E 
is an elliptic curve with gz > 3, Theorem 5.2 guarantees that necessarily rg > 1, and 
so we must have that rg > 3 when wg = —1. It is usually easy to check numerically 
that rg < 3 when gg = 3, and thus one can prove that there exist elliptic curves E 
over Q with rg = 3. For example, rg = 3 for the curve 


E:y+y=x —7x43, 


which has conductor C(E) = 5077, and gg = 3. Goldfeld’s work then enables 
one to give, for the first time, an explicit upper bound for the absolute value of the 
discriminants of all imaginary quadratic fields having any prescribed class number. 
The earlier work on this problem by Heilbronn and Siegel, while proving that the 
class number of an imaginary quadratic field tends to infinity with the absolute value 
of the discriminant of the field, was ineffective. 

In view of Theorem 5.2, it is plainly an important problem to decide when the 
hypothesis rg < 1 holds. In any particular numerical example, it is usually easy 
to settle this question, but our knowledge of theoretical results is still very limited. 
The most natural example of infinite families of elliptic curves defined over Q is 
given by the family of quadratic twists of a fixed elliptic curve E. If M is the 
discriminant of a quadratic field, E“ will denote the quadratic twist of E by the 
extension Q(./M)/Q (in other words, E™ is the unique elliptic curve defined over 
Q, which is not isomorphic to E over Q, but becomes isomorphic to E over Q(/M)). 
It is not difficult to see that, in the family of all quadratic twists E™ of a given E 
defined over Q, the root numbers wea = +1 and wea = —1 will each occur 
half the time. A folklore conjecture (see [21]) asserts that amongst those quadratic 
twists E™ with wean = +1 (respectively, with wean = —1), we should have 
rp@ = O (respectively, rea) = 1) outside a set of discriminants M of density 
zero, but this has never been proven for a single elliptic curve E. However, the 
papers [6] and [37] prove, by rather different methods, the important result that 
there always exist infinitely many discriminants M such that rg = 0, and infinitely 
many discriminants M such that rea) = 1. From the point of view of diophantine 
equations, there is great interest in establishing conditions on the prime factors of M 
which guarantee that rpun = 1, since E™) (Q) is infinite for such M by Theorem 5.2. 
The first result in this direction is due to Heegner [25], and subsequently Birch [4] 
generalised and reformulated it. We write [0 ] for the zero cusp on the modular 
curve Xo(C(E)). 


Theorem 5.4. Let E/Q be any elliptic such that $( [0 ]) is not contained in 2E(Q), 
and let p be any prime number such that p = 3mod4, and every prime dividing 
C(E) splits in the imaginary quadratic field Q(,/—p). Then rg—» = 1, and so, in 
particular, E (Q) is infinite. 
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Recently, Tian discovered a method for generalising this result to quadratic twists 
by discriminants having an arbitrary prescribed number of prime factors, which he 
first applied to the classical congruent number problem [55]. More generally, his 
method yields the following theorem [13]. 


Theorem 5.5. Let E/Q be any elliptic curve such that (i) $( [0 ]) is not contained 
in 2E(Q), and (ii) there exists a good supersingular prime q with q = \1mod4, 
and with C(E) a square modulo q. Then, for each integer k > 1, there exist 
infinitely many square free discriminants M having exactly k prime factors, and 
with (M, C(E)) = 1, such that rg = 1, whence, in particular, E™ (Q) is infinite. 


Finally, we mention a recent result proven by Bertolini, Darmon, and Rotger 
[1], which is a key first step towards generalizing Theorem 5.2 to certain finite 
Galois extensions of Q. Let p denote an odd, irreducible, 2-dimensional Artin 
representation of the absolute Galois group of Q. As usual, we define L(E, p, s) 
to be the Euler product attached to the tensor product of the Artin representation p 
with the /-adic Tate module of E. Since both E and p are known to be modular, it 
follows from the theory of modular forms that L(E, p, s) is also entire. 


Theorem 5.6. [f L(E, 9,1) 4 0, then p does not occur in E(F) ®z C, where F is 
the fixed field of the kernel of p. 


Moreover, very recent work of Kings, Lei, Loeffler and Zerbes [29-31] constructs a 
new Euler system, and they use it both to give another proof of Theorem 5.6, and, 
in addition, to show that, when L(E, p, 1) 4 0, the p-component of the p-primary 
subgroup of the Tate-Shafarevich group of E over F is finite for most primes p. 


6 The Exact Formula Prime by Prime 


In this section, we will discuss the partial results which are known about the con- 
jectural exact Birch-Swinnerton-Dyer formula for the order of the Tate-Shafarevich 
group of an elliptic curve E/Q when we assume that rg < 1. Theorem 5.2 assures us 
that, in this case, E(Q) has rank rg, and III(E/Q) is finite. We note that the order of 
the torsion subgroup E(Q) is easily determined, and has only one of 12 possibilities, 
thanks to the beautiful work of Mazur [32]. However, the classical theory of descent 
does not give a practical way to compute the order of the p-primary part of II(E/Q) 
once p > 5, and the only techniques which work at present are p-adic methods 
related to Iwasawa theory. In view of this, it is convenient to break the conjecture 
up into a p-part for every prime number p. To simplify the notation, we define 


Tam(E) = Coo (E) I] Cq(E). 


q\C(E) 
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It is also convenient to put 
L@®(E,1) = LE, 1)/Qz, 


which we know lies in Q by Theorem 5.3. Then, for every prime number p, the 
strong Birch-Swinnerton-Dyer conjecture predicts the following exact formula for 
the order of the p-primary subgroup II (£/Q)(p) of HI(E/Q) when rg = 0. 


p-Part of the Birch-Swinnerton-Dyer Conjecture for Analytic Rank 0. Assume 
that rg = 0. Then, for each prime p, we have 


(6.1) 
ord, (#(LL(E/Q)(p))) = ord,(L“® (E, 1)) + 2ord,(#(E(Q))) — ordy(Tam(E)). 


The strongest general result known about this p-part of the Birch and Swinnerton- 
Dyer conjecture is for E with complex multiplication, and is proven using the Euler 
system of elliptic units, combined with arguments from Iwasawa theory. The result 
is due to Rubin [42], but also uses earlier work of Yager [58]. 


Theorem 6.1. Assume that rg = 0, and that E admits complex multiplication by an 
order in an imaginary quadratic field K. If p is any prime which does not divide the 
order of the group of roots of unity of K, then the p-part of the Birch-Swinnerton- 
Dyer conjecture is valid for E. 


As an example of this theorem, consider the modular curve A = X0(49), which is 
an elliptic curve with equation 


Ary’ tay =x —2? —2x-1. 


Take E = A™), with M = q,...q,, where the q; are distinct primes, with 
gi = 1mod4 and q; = 3,5,6mod7, fori = 1,...,r. It is shown in [13] that 
ord,(L“®) (E, 1)) = r — 1 for all r > 0. This proves that L(E, 1) 4 0, and it is easy 
to see that it establishes the 2-part of the Birch-Swinnerton-Dyer conjecture for E. 
Hence, applying Theorem 6.1, we conclude that gz = 0, UI(E/Q) is finite, and the 
exact Birch-Swinnerton-Dyer formula is valid for the order of HI (E/Q). 

For elliptic curves without complex multiplication, the only way of attacking 
the p-part of the conjecture of Birch and Swinnerton-Dyer when rg = 0 is by 
considering the Iwasawa theory of E over the cyclotomic Z,-extension of Q. 
We srecall that, for p any prime, the cyclotomic Z,-extension of Q, which we denote 
by ®,, is the unique subfield of the field generated over Q by all p-power roots of 
unity, whose Galois group I’ over Q is isomorphic to the additive group of Zp. 
Mazur and Swinnerton-Dyer [36] were the first to prove the existence of a p-adic L- 
function attached to E over ®, when p is a prime of good ordinary reduction for E, 
and to formulate a “main conjecture” relating this p-adic L-function to the [-module 
given by the p~-Selmer group of E over ®gy. As a special case of a more general 
result, Schneider [45] showed that, for p any odd prime number where E has good 
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ordinary reduction, this “main conjecture” would indeed imply the p-part of the 
Birch-Swinnerton-Dyer conjecture for the order of I(E/Q) when rg = 0. The first 
major breakthrough in the direction of proving this main conjecture for odd good 
ordinary primes p was made by Kato [26]. He proved the existence of a remarkable 
new Euler system attached to E, and used it to prove a partial result in the direction 
of the “main conjecture” for sufficiently large good ordinary primes p. Subsequently, 
Skinner and Urban [48] have completed the proof of this “main conjecture” in many 
cases, by combining Kato’s result with deep arguments from the theory of modular 
forms. This leads to the following specific theorem [48] about the p-part of the 
conjecture of Birch and Swinnerton-Dyer. Let E, denote the Galois module of p- 
division points on E, and let 


Ve p : Gal(Q/Q) > Aut(E,) = GLa(F,) 


be the associated Galois representation. 


Theorem 6.2. Assume that E does not admit complex multiplication, and that 
rg = 0. Let p be any prime number such that (1) p # 2, (2) E has good ordinary 
reduction at p, (3) Ve is surjective, and (4) there exists a prime q, where E has bad 
multiplicative reduction, such that the Galois module E, is ramified at q. Then the 
p-part of the Birch-Swinnerton-Dyer conjecture is valid for E. 


In particular, if E is semistable (i.e. E has multiplicative reduction at all primes of 
bad reduction), and rg = 0, then this theorem establishes the p-part of the Birch- 
Swinnerton-Dyer conjecture for all primes p > 11 of good ordinary reduction. For 
primes p > 2 where E has good supersingular reduction and t, = 0, Wan [56] 
uses quite different methods in Iwasawa theory to give a proof of the p-part of the 
conjecture of Birch and Swinnerton-Dyer, assuming that rz = 0 and F is semistable. 

One can also formulate the p-part of the conjecture of Birch and Swinnerton- 
Dyer for every prime p when rg = 1. 


p-Part of the Birch-Swinnerton-Dyer Conjecture for Analytic Rank 1. Assume 
that rg = 1. Then, for each prime p, we have 


(6.2) ord, #(II(E/Q)(p))) = ordp(Lz/(2ERoo(E))) — ord, (Tam(E)). 


When E admits complex multiplication and rg = 1, the work of Kobayashi [28], 
Perrin-Riou [40], Pollak-Rubin [41], Rubin [42] establishes the following analogue 
of Theorem 6.1. 


Theorem 6.3. Assume that E admits complex multiplication and that rg = 1. Let 
p be any odd prime where E has good reduction. Then the p-part of the Birch- 
Swinnerton-Dyer conjecture is valid for E. 


When E does not admit complex multiplication, we have the following recent 
theorem of Zhang [60]. 
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Theorem 6.4. Assume that E does not admit complex multiplication, and that 
rg = 1. Let p be any prime number such that (1) p = 5, (2) E has good ordinary 
reduction at p, (3) Vg p is surjective, (4) there exist two primes qj(i = 1,2) of bad 
multiplicative reduction for E such that the Galois module E, is ramified at both 
qi and qz, and (5) If q is any prime of bad multiplicative reduction for E with 
q = +1modp, then E, is ramified at q. Then the p-part of the Birch-Swinnerton- 
Dyer conjecture is valid for E. 


Finally, it is also interesting to note that Kato’s work [27], combined with 
a theorem of Rohrlich [39], proves the following result, which was originally 
conjectured by Mazur [33]. Let 1,00 denote the group of all p-power roots of unity. 


Theorem 6.5. For all primes p, the abelian group E(Q({Lpe)) is finitely generated. 


7 A Numerical Example 


Although this article is not directly concerned with the conjecture of Birch and 
Swinnerton-Dyer for elliptic curves over number fields, I want to end by briefly 
explaining a remarkable and naturally occurring numerical example, related to the 
elliptic curves of conductor 11. We recall that 11 is the smallest conductor for an 
elliptic curve defined over Q, and there are three isogenous curves of conductor 11 
defined over Q. Two of these curves are given by 


Aiyty=x2%—-2, Airy +y =x — x7 — 7820x — 263580, 


and they are linked by a Q-isogeny w : Az — A, of degree 25. It is well known that 
A\(Q) = Z/5Z, and A2(Q) = 0. As was pointed out to me by Fisher and Matsuno, 
the splitting field of the Galois representation given by the kernel of w, which we 
denote by J, is the field Q(js, 7), where jus is the group of fifth roots of unity, and r 
denotes any root of the abelian polynomial 


2 — 65x" + 205x7 + 140x7 + 25x41. 


Around the year 2000, Matsuno discovered that the complex L-series of either of 
these two curves, when viewed as curves over J, has a zero of order 4 at s = 1. It 
therefore became an interesting numerical challenge to show that the group of points 
of either of these two curves with coordinates in J also has rank 4, as predicted by the 
natural generalization of the weak Birch-Swinnerton-Dyer conjecture. Recently, S. 
Donnelly (private communication) finally found four linearly independent points, 
using the MAGMA system in Sydney University. I am very grateful to him for 
providing the following data. Define E = A> to be the quadratic twist of A; by 
Q(V5), so that C(E) = 275. An equation for the curve E is given by 
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y? = x* — 10800x + 1026000. 
Then Donnelly discovered that there is a point in E(/) with x-coordinate 
(7.1) (1632096r* — 10653364877 + 3636966967 + 134074044r + 8312592) /41323 


and the conjugates of this point under the Galois group of Q(r)/Q span a subgroup 
of rank 4 in E(Q(r)). Moreover, the torsion subgroup of E(Q(r)) is trivial, and 
it is very probable that E(Q(r)) is generated by any four of the conjugates of 
the point (7.1). Also, the Birch-Swinnerton-Dyer conjecture predicts that the Tate- 
Shafarevich group of E over Q(r) should be trivial. Note that E(J) = A, (J) because 
V5 € J. It also seems very likely that A,(J) is generated by any four of the 
conjugates of the point (7.1), together with the point (0, 0) of order 5. Obviously, J 
is a subfield of the field F,, which is obtained by adjoining to Q the coordinates of 
all 5-power division points on any of the three curves of conductor 11. At present, 
these points found by Donnelly are the only known points of infinite order on the 
curves of conductor 11 with coordinates in Fy (see [11]). 
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An Essay on the Riemann Hypothesis 


Alain Connes 


Abstract The Riemann hypothesis is, and will hopefully remain for a long time, 
a great motivation to uncover and explore new parts of the mathematical world. 
After reviewing its impact on the development of algebraic geometry we discuss 
three strategies, working concretely at the level of the explicit formulas. The first 
strategy is “analytic” and is based on Riemannian spaces and Selberg’s work on the 
trace formula and its comparison with the explicit formulas. The second is based on 
algebraic geometry and the Riemann-Roch theorem. We establish a framework in 
which one can transpose many of the ingredients of the Weil proof as reformulated 
by Mattuck, Tate and Grothendieck. This framework is elaborate and involves 
noncommutative geometry, Grothendieck toposes and tropical geometry. We point 
out the remaining difficulties and show that RH gives a strong motivation to develop 
algebraic geometry in the emerging world of characteristic one. Finally we briefly 
discuss a third strategy based on the development of a suitable “Weil cohomology”, 
the role of Segal’s I’-rings and of topological cyclic homology as a model for 
“absolute algebra” and as a cohomological tool. 


1 Introduction 


Let (x) := #{p | p € Y, p < x} be the number of primes less than x with 5 added 
when x is prime. Riemann [85] found for the counting function! 


fo) =D nte'), 


‘Similar counting functions were already present in Chebyshev’s work. 
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the following formula involving the integral logarithm function Li(x) = i ae 
f(x) = Li(x) — Y> Lie’) + i ee! log2 (1) 
a - y @—1 tlogt e 


in terms~ of the non-trivial zeros p of the analytic continuation (shown as well as 
two proofs of the functional equation by Riemann at the beginning of his paper) of 
the Euler zeta function 


(9) = = 


Reading Riemann’s original paper is surely still the best initiation to the subject. 
In his lecture given in Seattle in August 1996, on the occasion of the 100th 
anniversary of the proof of the prime number theorem, Atle Selberg comments about 
Riemann’s paper: [88] 

It is clearly a preliminary note and might not have been written if L. Kronecker had not 

urged him to write up something about this work (letter to Weierstrass, Oct. 26 1859). It 

is clear that there are holes that need to be filled in, but also clear that he had a lot more 


material than what is in the note.*> What also seems clear : Riemann is not interested in an 
asymptotic formula, not in the prime number theorem, what he is after is an exact formula! 


The Riemann hypothesis (RH) states that all the non-trivial zeros of ¢ are on the 
line 5 + iR. This hypothesis has become over the years and the many unsuccessful 
attempts at proving it, a kind of “Holy Grail” of mathematics. Its validity is indeed 
one of the deepest conjectures and besides its clear inference on the distribution of 
prime numbers, it admits relations with many parts of pure mathematics as well as 
of quantum physics. 

It is, and will hopefully remain for a long time, a great motivation to uncover and 
explore new parts of the mathematical world. There are many excellent texts on RH, 
such as [10] which explain in great detail what is known about the problem, and the 
many implications of a positive answer to the conjecture. When asked by John Nash 
to write a text on RH," I realized that writing one more encyclopedic text would 
just add another layer to the psychological barrier that surrounds RH. Thus I have 
chosen deliberately to adopt another point of view, which is to navigate between 
the many forms of the explicit formulas (of which (1) is the prime example) and 
possible strategies to attack the problem, stressing the value of the elaboration of 
new concepts rather than “problem solving”. 


?More precisely Riemann writes (a)so (Lig +) ar Li(x!-*)) instead of }°, Li(x?) using 
the symmetry p — | — 9 provided by the functional equation, to perform the summation. 

3See [44, Chap. VII] for detailed support to Selberg’s comment. 

4My warmest thanks to Michael Th. Rassias for the communication. 
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¢ RH and algebraic geometry 
We first explain the Riemann-Weil explicit formulas in the framework of adeles 
and global fields in Sect. 2.1. We then sketch in Sect. 2.3 the geometric proof of 
RH for function fields as done by Weil, Mattuck, Tate and Grothendieck. We then 
turn to the role of RH in generating new mathematics, its role in the evolution of 
algebraic geometry in the twentieth century through the Weil conjectures, proved 
by Deligne, and the elaboration by Grothendieck of the notions of scheme and 
of topos. 

¢ Riemannian Geometry, Spectra and trace formulas 
Besides the proof of analogues of RH such as the results of Weil and of Deligne, 
there is another family of results that come pretty close. They give another 
natural approach of RH using analysis, based on the pioneering work of Selberg 
on trace formulas. These will be reviewed in Sect. 3 where the difficulty arising 
from the minus sign in front of the oscillatory terms will be addressed. 

¢ The Riemann-Roch strategy: A Geometric Framework 
In Sect. 4, we shall describe a geometric framework, established in our joint work 
with C. Consani, allowing us to transpose several of the key ingredients of the 
geometric proof of RH for function fields recalled in Sect. 2.3. It is yet unclear if 
this is the right set-up for the final Riemann-Roch step, but it will illustrate the 
power of RH as an incentive to explore new parts of mathematics since it gives a 
clear motivation for developing algebraic geometry in characteristic 1 along the 
line of tropical geometry. This will take us from the world of characteristic p to 
the world of characteristic 1, and give us an opportunity to describe its relation 
with semi-classical and idempotent analysis, optimization and game theory,> 
through the Riemann-Roch theorem in tropical geometry [3, 48, 82]. 

¢ Absolute Algebra and the sphere spectrum 
The arithmetic and scaling sites which are the geometric spaces underlying 
the Riemann-Roch strategy of Sect.4 are only the semiclassical shadows of a 
more mysterious structure underlying the compactification of Spec Z that should 
give a cohomological interpretation of the explicit formulas. We describe in this 
last section an essential tool coming from algebraic topology: Segal’s I'-rings 
and the sphere spectrum, over which all previous attempts at developing an 
absolute algebra organize themselves. Moreover, thanks to the results of Hessel- 
holt and Madsen in particular, topological cyclic homology gives a cohomology 
theory suitable to treat in a unified manner the local factors of L-functions. 


2 RH and Algebraic Geometry 


I will briefly sketch here the way RH, once transposed in finite characteristic, 
has played a determining role in the upheaval of the very notion of geometric 
space in algebraic geometry culminating with the notions of scheme and topos 


One of the topics in which John Nash made fundamental contributions. 
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due to Grothendieck, with the notion of topos offering a frame of thoughts of 
incomparable generality and breadth. It is a quite remarkable testimony to the unity 
of mathematics that the origin of this discovery lies in the greatest problem of 
analysis and arithmetic. 


2.1 The Riemann- Weil Explicit Formulas, Adeles 
and Global Fields 


Riemann’s formula (1) is a special case of the “explicit formulas” which establish 
a duality between the primes and the zeros of zeta. This formula has been extended 
by Weil in the context of global fields which provides a perfect framework for a 
generalization of RH since it has been solved, by Weil, for all global fields except 
number fields. 


2.1.1 The Case of ¢ 

Let us start with the explicit formulas (cf. [9, 59, 83, 101, 103]). We start with a 
function F(u) defined for u € [1, 00), continuous and continuously differentiable 
except for finitely many points at which both F(u) and F’(u) have at most a 


discontinuity of the first kind,° and such that, for some € > 0, F(u) = O(u7!/2~*). 
One then defines the Mellin transform of F as 


@(s) = / * E(u) udu (2) 
1 


The explicit formula then takes the form 


1 1 1 = 
eee) 2 26-9 = 2D ere PG) 


p€Zeros p m=1 


dt 


y  logx 0° 3/2 F(t) — F(1) 
se F(A an SS ais 
G+ 2 ) a+ f t(t? — 1) 
where y = —I’’(1) is the Euler constant, and the zeros are counted with their 
multiplicities i.e. >) ,ezer0s (Pp — 5) means )> peZeros OFder(p) P(p — 5). 


6 And at which the value of F(u) is defined as the average of the right and left limits there. 
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2.1.2 Adeles and Global Fields 


By a result of Iwasawa [66] a field K is a algebraic number field, or an algebraic 
function field of one variable over a finite constant field, if and only if there 
exists a semi-simple (i.e. with trivial Jacobson radical [68]) commutative ring R 
containing K such that R is locally compact, but neither compact nor discrete and K 
is discrete and cocompact in R. This result gives a conceptual definition of what is 
a “global field” and indicates that the arithmetic of such fields is intimately related 
to analysis on the parent ring R which is called the ring of adeles of K [93, 102]. 
It is the opening door to a whole world which is that of automorphic forms and 
representations, starting in the case of GL; with Tate’s thesis [93] and Weil’s book 
[102]. Given a global field K, the ring Ax of adeles of K is the restricted product 
of the locally compact fields K, obtained as completions of K for the different 
places v of K. The equality dax = |a|dx for the additive Haar measure defines the 
module Mod : K, — R+, Mod(a) := |a| on the local fields K, and also as a group 
homomorphism Mod : Cx — R%4 where Cx = GL;(Ax)/K” is the idele class 
group. The kernel of the module is a compact subgroup Cx; C Cx and the range 
of the module is a cocompact subgroup Mod(IK) C R%. On any locally compact 
modulated group, such as Cx or the multiplicative groups K*, one normalizes the 
Haar measure d*u uniquely so that the measure of {u | 1 < |u| < A} is equivalent 
to log A when A — ov. 


2.1.3 Weil’s Explicit Formulas 


As shown by Weil, in [103], adeles and global fields give the natural framework for 
the explicit formulas. For each character y € Cx one chooses an extension 7 to Cx 
and one lets Z; be the set (with multiplicities and taken modulo the orthogonal of 
Mod(R), i.e. {s € C | g’ = 1, Vg € Mod(K)}) of zeros of the L-function associated 
to x. Let then a be a nontrivial character of Ax/K and a = || ay its local factors. 
The explicit formulas take the following form, with h € (Cx) a Schwartz function 
with compact support: 


7 1 
io + Diz = ‘a — re (4) 


xeCu 1 Zi 


where the principal value f. is normalized by the additive character a, (cf. [22, 
Chap. II, Sect. 8.5, Theorem 2.44] for the precise notations and normalizations) and 
for any character w of Cx one lets 


h(w,z) = [row lu at A(t) := h(, 0) (5) 
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For later use in Sect. 4.1 we compare (3) with the Weil way (4) of writing the explicit 
formulas. Let the function h be the function on Cg given by h(u) := u|-2 F (ul) 
(with F(v) = 0 for v < 1). Then h(w,z) = 0 for characters with non-trivial 
restriction to Cg; = Z*, while h(1,z) = ®(z — 5). Moreover note that for the 
archimedean place v of K = Q one has, disregarding the principal values for 


simplicity, 
h(u"! h 
ae ws 
Ks [1 — u| r* |1— uo" 


ee a 1 1 dt er al alc 63), 
=x h(t) + = dt 
2S fl—r | jl+er |) t 1 t(t-1) 


where the 5 comes from the normalization of the multiplicative Haar measure 


of R* viewed as a modulated group. In a similar way, the normalization of the 
multiplicative Haar measure on Q shows that for the finite place associated to 


the prime p one gets the term )~°_, logp p CPF (p”. 


2.2 RH for Function Fields 


When the module Mod(K) of a global field is a discrete subgroup of R%, it is of the 
form Mod(K) = q@ where q is a prime power, and the field K is the function field 
of a smooth projective curve C over the finite field Fy. 

Already at the beginning of the twentieth century, Emil Artin and Friedrich Karl 
Schmidt have generalized RH to the case of function fields. We refer to the text of 
Cartier [16] where he explains how Weil’s definition of the zeta function associated 
to a variety over a finite field slowly emerged, starting with the thesis of E. Artin 
where this zeta function was defined for quadratic extensions of F,[T], explaining 
F. K. Schmidt’s generalization to finite extensions of F,[T] and the work of Hasse 
on the Riemann hypothesis for elliptic curves over finite fields. 

When the global field K is a function field, geometry comes to the rescue. The 
problem becomes intimately related to the geometric one of estimating the number 
N(q’) := #C(F,,) of points of C rational over a finite extension F, of the field of 
definition of C. The analogue of the Riemann zeta function is a generating function: 
the Hasse-Weil zeta function 


Eels) = Z(C.q™), Z(C,T) := exp Mg) — (6) 


rl 


The analogue of RH for €c was proved by André Weil in 1940. Pressed by the 
circumstances (he was detained in jail) he sent a Comptes-Rendus note to E. Cartan 
announcing his result. Friedrich Karl Schmidt and Helmut Hasse had previously 
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been able to transpose the Riemann-Roch theorem in the framework of geometry 
over finite fields and shown its implications for the zeta function: it is a rational 
fraction (of the variable T) and it satisfies a functional equation. But it took André 
Weil several years to put on solid ground a general theory of algebraic geometry 
in finite characteristic that would justify his geometric arguments and allow him 
to transpose the Hodge index theorem in the form due to the Italian geometers 
Francesco Severi and Guido Castelnuovo at the beginning of the twentieth century. 


2.3 The Proof Using Riemann-Roch on C x C 


Let C be a smooth projective curve over the finite field F,. The first step is to extend 
the scalars from F, to an algebraic closure F',. Thus one lets 


€:= Ces F, (7) 


This operation of extension of scalars does not change the points over Fy, i le. one 
has C (F,) = Cc (F,). The Galois action of the Frobenius automorphism of Fy raises 
the coordinates of any point x € C (F, ) to the gth power and this dandomnalion 
of C(F,) coincides with the relative Frobenius Fr, := Frc x Id of C, where Frc 
is the absolute Frobenius of C (which is the identity on points of the scheme and 
the qth power map in the structure sheaf). The relative Frobenius Fr, is F, -linear by 
construction and one can consider its graph in the surface X = C Xf, C which is 


the square of C. This graph is the Frobenius correspondence W. It is important to 
work over an algebraically closed field in order to have a good intersection theory. 
This allows one to express the right hand side of the explicit formula (4) for the 
zeta function ¢c as an intersection number D.A, where A is the diagonal in the 
square and D = >> a,W* is the divisor given by a finite integral linear combination 
of powers of the Frobenius correspondence. The terms h(0), hi 1) in the explicit 
formula are also given by intersection numbers D.&;, where 


f& =e9xC, & =Cxe (8) 


where the e; are points of C. One then considers divisors on X up to the additive 
subgroup of principal divisors i.e. those corresponding to an element f € .% of 
the function field of X. The problem is then reduced to proving the negativity of 
D.D (the self-intersection pairing) for divisors of degree zero. The Riemann-Roch 
theorem on the surface X gives the answer. To each divisor D on X corresponds 
an index problem and one has a finite dimensional vector space of solutions 
H°(X, O(D)) over F,. Let 


£(D) = dim H°(X, G(D)) (9) 
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The best way to think of the sheaf @(D) is in terms of Cartier divisors, i.e. a global 
section of the quotient sheaf %*/ OG, where .% is the constant sheaf corresponding 
to the function field of X and @x is the structure sheaf. The sheaf @(D) associated 
to a Cartier divisor is obtained by taking the sub-sheaf of % whose sections on U; 
form the sub @y-module generated by f~'! € I'(U;,.%*) where the f; represent D 
locally. One has a “canonical” divisor K and Serre duality 


dim H?(X, G(D)) = dim H°(X, @(K — D)) (10) 
Moreover the following Riemann-Roch formula holds 


2 
Y\(-Dydim H1(X, O(D)) = 5D.(D—K) + 7(X) (11) 
0 


where y(X) is the arithmetic genus. All this yields the Riemann-Roch inequality 
1 
€(D) + &(K — D) = 3D.D— K) + x(X) (12) 


One then applies Lemma 1 to the quadratic form s(D, D’) = D.D’ using the &; of (8). 
One needs three basic facts [54] 


1. If £(D) > 1 then D is equivalent to a strictly positive divisor. 
2. If Dis a strictly positive divisor then 


D.&) + D.E; > 0 


3. One has &.€; = 1 and §.& = 0. 


One then uses (12) to show (see [54]) that if D.D > 0 then after a suitable rescaling 
by n > O orn < 0 one gets (nD) > 1 which shows that the hypothesis (2) of the 
following simple Lemma | is fulfilled, and hence that RH holds for c, 


Lemma 1. Let s(x, y) be a symmetric bilinear form on a vector space E (over Q 
or R). Let & € E, j € {0, 1}, be such that 


1. 8(§, &) => 0 and 5(&, &) = 1, 
2. For any x € E such that s(x,x) > 0 one has 5(x, &) 4 0 or s(x, &) 4 0. 


Then one has the inequality 
5(x,x) < 28(x, fo), 61), WrE E (13) 


The proof takes one line but the meaning of this lemma is to reconcile the “naive 
positivity” of the right hand side of the explicit formula (4) (which is positive when 
h = 0 vanishes near u = 1) with the negativity of the left hand side needed to prove 
RH (cf. Sect. 3.1 (17) below). 
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At this point we see that it is highly desirable to find a geometric framework 
for the Riemann zeta function itself, in which the Hasse-Weil formula (6), the 
geometric interpretation of the explicit formulas, the Frobenius correspon- 
dences, the divisors, principal divisors, Riemann-Roch problem on the curve 
and the square of the curve all make sense. 


Such a tentative framework will be explained in Sect. 4. It involves in partic- 
ular the refinement of the notion of geometric space which was uncovered by 
Grothendieck and to which we now briefly turn. 


2.4 Grothendieck and the Notion of Topos 


The essential ingredients of the proof explained in Sect.2.3 are the intersection 
theory for divisors on C x C, sheaf cohomology and Serre duality, which give 
the formulation of the Riemann-Roch theorem. Both owe to the discovery of sheaf 
theory by J. Leray and the pioneering work of J. P. Serre on the use of sheaves 
for the Zariski topology in the algebraic context, with his fundamental theorem 
comparing the algebraic and analytic frameworks. The next revolution came from 
the elaboration by A. Grothendieck and M. Artin of etale £-adic cohomology. 
It allows one to express the Weil zeta function of a smooth projective variety X 
defined over a finite field F, i.e. the function Z(X, 1) given by (6) witht = gq“ 
which continues to make sense in general, as an alternate product of the form 


2 dimX 
Z(X.1) = [] det(1—1F* | (Xe, QQ) OY (14) 
j=0 


where F* corresponds to the action of the Frobenius on the £-adic cohomology and 
£ is a prime which is prime to g. This equality follows from a Lefschetz formula for 
the number N(q’) of fixed points of the rth power of the Frobenius and when X = C 
is a curve the explicit formulas reduce to the Lefschetz formula. The construction 
of the cohomology groups H/(X.,Qv) is indirect and they are defined as : 


FF (Xer, Qe) = lim (H! (Xe, Z/€"Z)) @z, Qe 


n 


where Xet is the etale site of X. Recently the etale site of a scheme has been refined 
[4] to the pro-etale site whose objects no longer satisfy any finiteness condition. 
The cohomology groups H/ (Seek: Q) are then directly obtained using the naive 
interpretation (without torsion coefficients). One needs to pay attention in (14) to 
the precise definition of F, it is either the relative Frobenius Fr, or the Geometric 


234 A. Connes 


Frobenius Frg which is the inverse of the Arithmetic Frobenius Fr. The product 
Fr, o Fr, = Fr, o Fr, is the absolute Frobenius Fr which acts trivially on the £-adic 
cohomology. To understand the four different incarnations of “the Frobenius” it is 
best to make them explicit in the simplest example of the scheme Spec R where 
R= F, [T] is the ring of polynomials P(T) = }*a;T’, a; € F, 


* Geometric Frobenius: )> a;T/ > a; /9Ti 
¢ Relative Frobenius: P(T) +> P(T‘) 

¢ Absolute Frobenius: P(T) +> P(T)4 

* Arithmetic Frobenius: )* ajT/ > )- aT! 


The motivation of Grothendieck for developing etale cohomology came from the 
search of a Weil cohomology and the Weil conjectures which were solved by 
Deligne in 1973 [37]. 

In his quest Grothendieck uncovered several key concepts such as those of 
schemes and above all that of topos, see [2] and [78], in his own words: 


C’est le théme du topos, et non celui des schémas, qui est ce “lit”, ou cette “riviére 
profonde”’, ot: viennent s’épouser la géométrie et l’algébre, la topologie et |’ arithmétique, 
la logique mathématique et la théorie des catégories, le monde du continu et celui des 
structures “discontinues” ou “discrétes”. Si le theme des schémas est comme le ceur 
de la géométrie nouvelle, le theme du topos en est l’enveloppe, ou la demeure. Il est 
ce que j’ai concu de plus vaste, pour saisir avec finesse, par un méme langage riche en 
résonances géométriques, une “essence” commune 4 des situations des plus éloignées les 
unes des autres, provenant de telle région ou de telle autre du vaste univers des choses 
mathématiques. 


3 Riemannian Geometry, Spectra and Trace Formulas 


Riemannian Geometry gives a wealth of “spectra” of fundamental operators associ- 
ated to a geometric space, such as the Laplacian and the Dirac operators. 


3.1 The Selberg Trace Formula 


In the case of compact Riemann surfaces X with constant negative curvature —1, 
the Selberg trace formula [86], takes the following form where the eigenvalues of 
the Laplacian are written in the form’ 2, = —(; + 12). Let & > 0, h(r) be an 
analytic function in the strip |3(r)| < 5 + 6 and such that h(r) = h(—r) and with 
(1 + r°)'**|A(r)| being bounded. Then [60, 86, 87], with A the area of X, 


7Where the argument of r, is either 0 or —2/2. 
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Sohn) = < /  aehGEANO y- Wa went sloenm) (15) 
— {T} _— ° 


where g is the Fourier transform of h, i.e. more precisely g(s) = + bee h(r)e"*dr. 
The log N(T) are the lengths of the periodic orbits of the geodesic flow with 
log N(To) being the length of the primitive one. Already in 1950-51, Selberg saw 
the striking similarity of his formula with (3) which (cf. [60]) can be rewritten in the 
following form, with / and g as above and the non-trivial zeros of zeta expressed in 
the form p = 5 + iy, 


Sony) = ns) + n(—4) + x ij w(nh(rdr —25~ A(n)n~2 g(logn) (16) 
y —0o 


where 


cee Gee a ee Pie fa Vs, H(s) > 0 
od aa aa OR = eg y, Vs, R(s 


and A(n) is the von-Mangoldt function with value log p for powers p‘ of primes 
and zero otherwise. Moreover Selberg found that there is a zeta function which 
corresponds to (15) in the same way that ¢(s) corresponds to (16). The role of Hilbert 
space is crucial in the work of Selberg to ensure that the zeros of his zeta function 
satisfy the analogue of RH. This role of Hilbert space is implicit as well in RH which 
has been reformulated by Weil as the positivity of the functional W(g) defined as 
both sides of (16). More precisely the equivalent formulation is that W(g * g*) > 0 
on functions g which correspond to Fourier transforms of analytic functions h as 
above (i.e. even and analytic in a strip |Sz| < + + 5) where for even functions one 
has g*(s) := g(—s) = g(s). Moreover by [11, 15], it is enough, using Li’s criterion 
(cf. [11, 73]), to check the positivity on a small class of explicit real valued functions 
with compact support. In fact for later purposes it is better to write this criterion as 


RH <=> s(f.f) <0. Vf | : flu)d*u = / Flu)du = 0 (17) 


where for real compactly supported functions on R*., we let s(f,g) := N(f * 8) 
where « is the convolution product on R*, 2(u) := u—!g(u7'), and 


0° oo 2 _ 
N(h) = Y\ A@h(n) + / emg) 


1 
d*ut+tch(),c= 7 (logx +y) (18) 
1 


n=1 


The Selberg trace formula has been considerably extended by J. Arthur and plays 
a key role in the Langland’s program. We refer to [1] for an introduction to this vast 
topic. 
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3.2 The Minus Sign and Absorption Spectra 


The Selberg trace formula [86, 87] for Riemann surfaces of finite area, acquires 
additional terms which make it look e.g. in the case of X = H/PSL(2, Z) (where H 
is the upper half plane with the Poincaré metric) even more similar to the explicit 
formulas, since the parabolic terms now involve explicitly the sum 


2 ae A 202 logn) 


n=1 


Besides the square root in the A(7) terms in the explicit formulas (16) 


ay 


there is however a striking difference which is that these terms occur with a 
positive sign instead of the negative sign in (16), as discussed in [60, Sect. 12]. This 
discussion of the minus sign was extended to the case of the semiclassical limit of 
Hamiltonian systems in physics in [5, 6]. In order to get some intuition of what this 
reveals, it is relevant to go back to the origin of spectra in physics, i.e. to the very 
beginning of spectroscopy. It occurred when Joseph Von Fraunhofer (1787-1826) 
could identify, using self-designed instruments, about 500 dark lines in the light 
coming from the sun, decomposed using the dispersive power of a spectroscope 
such as a prism (cf. Fig. 1). These dark lines constitute the “absorption spectrum” 
and it took about 45 years before Kirchhoff and Bunsen noticed that several of 
these Fraunhofer lines coincide (i.e. have the same wave length) with the bright 
lines of the “emission” spectrum of heated elements, and showed that they could be 
reobtained by letting white light traverse a cold gas. In his work on the trace formula 
in the finite covolume case, Selberg had to take care of a superposed continuous 
spectrum due to the presence of the non-compact cusps of the Riemann surface. 


3.3. The Adele Class Space and the Explicit Formulas 


I had the chance to be invited at the Seattle meeting in 1996 for the celebration of 
the proof of the prime number theorem. The reason was the paper [12] (inspired 
from [69]) in which the Riemann zeta function appeared naturally as the partition 
function of a quantum mechanical system (BC system) exhibiting phase transitions. 
The RH had been at the center of discussions in the meeting and I knew the 
analogy between the BC-system and the set-up that V. Guillemin proposed in [56] 
to explain the Selberg trace formula using the action of the geodesic flow on the 
horocycle foliation. To a foliation is associated a von Neumann algebra [18], and 
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Continuous spectrum 


Absorption spectrum 


Fig. 1 The three kinds of spectra occurring in spectroscopy: (1) The top one is the “continuous 
spectrum” which occurs when white light is decomposed by passing through a prism. (2) The 
middle one is the “emission spectrum” which occurs when the light emitted by a heated gas is 
decomposed by passing through a prism and gives shining lines-a signature of the gas-over a 
dark background. (3) The third one is the “absorption spectrum” which occurs when white light 
traverses a cold gas and is then decomposed by passing through a prism. It appears as dark lines in 
a background continuous spectrum. The absorption lines occur at the same place as the emission 
lines 


the horocycle foliation on the sphere bundle of a compact Riemann surface gives a 
factor of type II, on which the geodesic flow acts by scaling the trace. An entirely 
similar situation comes canonically from the BC-system at critical temperature and 
after interpreting the dual system in terms of adeles, I was led by this analogy to 
consider the action of the idele class group of Q on the adele class space, i.e. the 
quotient Q*\Ag of the adeles Ag of Q by the action of Q*. I knew from the 
BC-system that the action of Q*, which preserves the additive Haar measure, is 
ergodic for this measure and gives the same factor of type IIbo as the horocycle 
foliation. Moreover the dual action scales the trace in the same manner. 

Let K be a global field and Ck = GL \(Ax)/K” the idele class group. The 
module Mod : Cx — R% being proper with cocompact range, one sees that the 
Haar measure on the Pontrjagin dual group of Cx is diffuse. Since a point is of 
measure 0 in a diffuse measure space there is no way one can see the absorption 
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spectrum without introducing some smoothness on this dual which is done using a 
Sobolev space if (Cx) of functions on Cx which (for fixed 5 > 1) is defined as 


Hg = i EQ) oe) dx, p(x) := (1 + log |x?) (19) 


Definition 1. Let K be a global field, the adele class space of K is the quotient 
Xx = Ax/K”% of the adeles of K by the action of K* by multiplication. 


We then consider the codimension 2 subspace .Y(Ax)o of the Bruhat-Schwartz 
space -Y(Ax) (cf. [13]) given by the conditions f(0) = 0, ff dx = 0 The Sobolev 
space L¥(Xix)o is the separated completion of .7(Ax)o for the norm with square 


iil? = [ 12 Flax)? pe) Ixlax (20) 


qeK* 


Note that by construction all functions of the form f(x) = g(x) — g(qx) for some 
q € K% belong to the radical of the norm (20), which corresponds to the operation 
of quotient of Definition 1. In particular the representation of ideles on ./(Ax) 
given by 


(B(a@)E)(x) = EC 'x) Va € GL) (Ag), x € Ax (21) 


induces a representation 7%, of Cx on Li (Xx)o- One has by construction a natural 
isometry € : Ls (Xx)o => E (Cx) which intertwines the representation 3, with the 
regular representation of Cx in Ls (Cx) multiplied by the square root of the module. 
This representation restricts to the cokernel of the map €, which splits as a direct 
sum of subspaces labeled by the characters of the compact group Cx; = Ker Mod 
and its spectrum in each sector gives the zeros of L-functions with Grdssencharakter. 
The shortcoming of this construction is in the artificial weight p(x), which is needed 
to see this absorption spectrum but only sees the zeros which are on the critical line 
and where the value of 6 artificially cuts the multiplicities of the zeros (cf. [19]). 

This state of affairs is greatly improved if one gives up trying to prove RH but 
retreats to an interpretation of the explicit formulas as a trace formula. One simply 
replaces the above Hilbert space set-up by a softer one involving nuclear spaces 
[80]. The spectral side now involves all non-trivial zeros and, using the preliminary 
results of [14, 19, 20] one gets that the geometric side is given by: 


=I 
Tra ( f iiw)9 o»)d"w) =o fo oe (22) 


We refer to [19, 22, 80] for a detailed treatment. The subgroups KX C Cx = 
GL; (Ax)/GLi(K) arise as isotropy groups. One can understand why the terms 
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h(w!) 
[1 — w 
of the scaling operator T = v,,-1 when working on the local field K, completion of 
the global field K at the place v, one has 


occur in the trace formula by computing, formally as follows, the trace 


TE®) = E(x) = / kx, EQ)dy 


so that T is given by the distribution kernel k(x, y) = 6(wx — y) and its trace is 


Traist(7) = [ren dx = | s0x-29 a0 = af 8 dz= ewes 


lw— 1 


When working at the level of adeles one treats all places on the same footing 
and thus there is an overall minus sign in front of the spectral contribution. Thus the 
Riemann spectrum appears naturally as an absorption spectrum from the adele class 
space. As such, it is difficult to show that it is “real”. While this solves the problem 
of giving a trace formula interpretation of the explicit formulas, there is of course 
still room for an interpretation as an emission spectrum. However from the adelic 
point of view it is unnatural to separate the contribution of the archimedean place. 


4 The Riemann-Roch Strategy: A Geometric Framework 


In this section we shall present a geometric framework which has emerged over the 
years in our joint work with C. Consani and seems suitable in order to transpose the 
geometric proof of Weil to the case of RH. The aim is to apply the Riemann-Roch 
strategy of Sect. 2.3. The geometry involved will be of elaborate nature inasmuch as 
it relies on the following three theories: 


1. Noncommutative Geometry. 
2. Grothendieck topoi. 
3. Tropical Geometry. 


4.1 The Limit q —> 1 and the Hasse-Weil Formula 


In [91, cf. Sect.6], C. Soulé, motivated by [79, cf. Sect. 1.5] and [38, 39, 70, 71, 
92, 95], introduced the zeta function of a variety X over F, using the polynomial 
counting function N(x) € Z|x] associated to X. The definition of the zeta function is 
as follows 


éx(s) = lim Z(X,q“)(q—-)"™, — s eR (23) 
qa 
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where Z(X,q *) denotes the evaluation at T = q™* of the Hasse-Weil exponential 
series 


ore 
Z(X,T) := exp NG) — (24) 


r>1 


For instance, for a projective space P” one has V(q) = 1+q+...+ q" and 


1 

Spm) (s) = lim (q — 1)"* "Senge, (8) = Te-) 

It is natural to wonder on the existence of a “curve” C suitably defined over 
F,, whose zeta function ¢c(s) is the complete Riemann zeta function €g(s) = 
cer (s/2)f(s) (cf. also [79]). The first step is to find a counting function N(q) 
defined for qg € [1,0o) and such that (23) gives €g(s). But there is an obvious 
difficulty since as N(1) represents the Euler characteristic one should expect that 
N(1) = —on (since the dimension of H' is infinite). This precludes the use of (23) 
and also seems to contradict the expectation that N(q) > 0 for g € (1,00). As 
shown in [23, 24] there is a simple way to solve the first difficulty by passing to the 
logarithmic derivatives of both terms in Eq. (23) and observing that the Riemann 
sums of an integral appear from the right hand side. One then gets instead of (23) 
the equation: 


dsEn(s) _ 
én (5) 


Thus the integral equation (25) produces a precise equation for the counting function 
Nc(q) = N(q) associated to C: 


Asbols) _ 
Cals) 


- [> vw u*d*u (25) 
1 


- [ N(u)u*d*u (26) 
1 


One finds that this equation admits a solution which is a distribution and is given 
with g(u) := >°._,, n A(n), by the equality 


d 
N(u) = ra + k(u) (27) 


where x (u) is the distribution which appears in the explicit formula (3), 


u 


oo co 2 ne 
/ K(u)f(u)d*u = / PO) LO tu + ofl), c= F(logx +) 
1 1 -_ 
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The conclusion is that the distribution N(u) is positive on (1, co) and is given by 


yet! 


p+1 


N(u) =u— < ) * order(p) +1 (28) 


pEZ 
where the derivative is taken in the sense of distributions, and the value at u = 1 of 


yet! 1 / 
= eo) y 1 log4x  &(-)) 
the term w(u) = y oe a is given by 5 + 5 + S— Fat) * 
peEZ 


The primitive J(u) = we — o(u) + u of N(u) is an increasing function on (1, co), 
but tends to —co when u — 1+ while its value J(1) is finite (Fig. 2). The tension 
between the positivity of the distribution N(q) for g > 1 and the expectation that 
its value N(1) should be N(1) = —oo is resolved by the theory of distributions: N 
is finite as a distribution, but when one looks at it as a function its value at g = 1 
is formally given by 


1 -—oa(l 1 1 
om t+ 9) od) ElogE, ae 
=>0 € 2 € 


N() =2-1 


120 


100 


80 J(u) = fi Nw) du 
60 
40 


20 


2 4 6 8 10 12 14 


Fig. 2 This represents a function J(u) which is a primitive of the counting distribution N(u). 
This function is increasing and tends to —oo when u — 1. The wiggly graph represents the 
approximation of J(u) obtained using the symmetric set Z,, of the first 2m zeros, by 


uP 


+1 
Pes iis 


2 
Jin (u) = 7 > y order(p) 
Z, 


am. 


Note that J(u) —> —oo when u > 1+ 


242 A. Connes 


which is —oo and in fact reflects, when € — 0, the density of the zeros. Note that this 
holds independently of the choice of the principal value in the explicit formulas. This 
subtlety does not occur for function fields K since their module Mod(K) is discrete 
so that distributions and functions are the same thing. There is one more crucial 
nuance between the case K = Q and the function fields: the distribution «(u) which 
is the archimedean contribution to N(u) in (27), does not fulfill the natural inequality 
N(q) < N(q‘) expected of a counting function. This is due to the terms |1 — u|~! in 
the Weil explicit formula, which as explained in Sect. 2.1.3 contribute non-trivially 
at the archimedean place, and indicate that the counting needs to take into account 
an ambient larger space and transversality factors as in [56]. In fact, we have seen in 
Sect. 3.3 that the noncommutative space of adele classes of a global field provides a 
framework to interpret the explicit formulas of Riemann-Weil in number theory as 
a trace formula, and that the geometric contributions give the right answer. In [24], 
we showed that the quotient 


Xq := Q*\Ag/Z”* (29) 


of the adele class space Q*\Ag of the rational numbers by the maximal compact 
subgroup Z* of the idele class group, gives by considering the induced action of 
R*, the above counting distribution N(u), u € [1, 00), which determines, using the 
Hasse-Weil formula in the limit g — 1, the complete Riemann zeta function. The 
next step is to understand that the action of R% on the space Xq is in fact the action 
of the Frobenius automorphisms Fr, on the points of the arithmetic site—an object 
of algebraic geometry—over IR{*. To explain this we first need to take an excursion 
in the exotic world of “characteristic one”. 


4.2 The World of Characteristic 1 


The key words here are: Newton polygons, Thermodynamics, Legendre transform, 
Game theory, Optimization, Dequantization, Tropical geometry. One alters the basic 
operation of addition of positive real numbers, replacing x+y by xV y := max(x, y). 
When endowed with this operation as addition and with the usual multiplication, the 
positive real numbers become a semifield R%**. It is of characteristic 1, i.e. 1V1 = 1 
and contains the smallest semifield of characteristic 1, namely the Boolean semifield 
B = {0, 1}. Moreover, R'?** admits non-trivial automorphisms and one has 


Galg(R3™) := Autg(R™) = R4, Fr() =2x7, Vx e RR, Ae RY 


thus providing a first glimpse of an answer to Weil’s query in [100] of an algebraic 
framework in which the connected component of the idele class group would 
appear as a Galois group. More generally, for any abelian ordered group H we let 
Hmax = H U {—0o} be the semifield obtained from H by the max-plus construction, 


An Essay on the Riemann Hypothesis 243 


i.e. the addition is given by the max, and the multiplication by +. In particular Ryax 
is isomorphic to R?“ by the exponential map (cf. [49]). Historically, and besides 
the uses of Rmax in idempotent analysis and tropical geometry which are discussed 
below, an early use of Ryyax occurred in the late fifties in the work of R. Cuninghame- 
Green in Birmingham, who established the spectral theory of irreducible matrices 
with entries in Ryax (cf. [34]) and in the sixties, in Leningrad, where Vorobyev 
used the Rmax formalism in his work motivated by combinatorial optimization, 
and proved a fundamental covering theorem. A systematic use of the Rmax algebra 
was developed by the INRIA group at the beginning of the 80’s in their work on 
the modelization of discrete event systems [17]. We refer to [49, 50] for a more 
detailed history of the subject, and for overwhelming evidence of its relevance in 
mathematics. We shall just give here a sample of this evidence starting by a really 
early occurrence in the work of C.G.J. Jacobi® and hoping to convince the reader 
that it would be a mistake to dismiss this algebraic formalism and the analogy with 
ordinary algebra as trivial. 


4.2.1 Optimization, Jacobi 


One of the early instances, around 1840, of the use of matrices over Riax is the work 
of Jacobi [67] on optimal assignment problems, where he states 


: Problema 
Disponantur nn quantitates AY quaecunque in schema Quadrati, ita ut habeantur n series 
horizontales et n series verticales, quarum quaeque est n terminorum. Ex illis quantitatibus 
eligantur n transversales, i.e. in seriebus horizontalibus simul atque verticalibus diversis 
positae, quod fieri potest n! modis; ex omnibus illis modis quaerendus est is, qui summam 
n numerorum electorum suppeditet maximam. 


In other words, given a square matrix mz = ne? he looks for the maximum over 
all permutations o of the quantity }° mjo,j). Using the algebraic rules of Rmax one 
checks that he is in fact computing the analogue of the determinant for the matrix 
mix. In fact the perfect definition of the determinant is more subtle and was obtained 
in the work of Gondran-Minoux [53], instead of max )* Mjo(j) Where o runs over all 
permutations, one uses the signature of permutations and considers the pair 


(det4 (mj), det_(mjx)), dets (mix) = max 3 Mio (j) 


sign(o)=-— 


The remarkable fact is that the Cayley-Hamilton theorem now holds, as the equality 
of two terms P4(m) = P_(m) corresponding to the characteristic polynomial 
P = (Px, P_). Each of the terms P+(m) € M,(Rmax) is computed from the original 
matrix m € M,(Rmax) using the rules of matrices with entries in Ryax which turn 
M,,(Rmax) into a semiring. 


5] am grateful to S. Gaubert for pointing out this early occurrence. 
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4.2.2 Idempotent Analysis 


The essence of the theory of semiclassical analysis in physics rests in the compari- 
son of quantum systems with their semiclassical counterpart, [5, 46, 55, 57, 58]. In 
the eighties V. P. Maslov and his collaborators developed a satisfactory algebraic 
framework which encodes the semiclassical limit of quantum mechanics. They 
called it idempotent analysis. We refer to [72, 74] for a detailed account and just 
mention briefly some salient features here. The source of the variational formula- 
tions of mechanics in the classical limit is the behavior of sums of exponentials 


Sj inf Sj 


) ehk~e k , whenh—-O 


which are, when # — 0, dominated by the contribution of the minimum of S. The 
starting observation is that one can encode this fundamental principle by simply 
conjugating the addition of numbers by the power operation x +> x* and passing to 
the limit when € — 0. The new addition of positive real numbers is 


€ 
lim (x + y*) = max{x,y}=xVy 
e>0 


and one recovers R®*“ as the natural home for semiclassical analysis. The super- 
position principle of quantum mechanics, i.e. addition of vectors in Hilbert space, 
now makes sense in the limit and moreover the “fixed point argument” proof of 
the Perron-Frobenius theorem works over R%** and shows that irreducible compact 
operators have one and only one eigenvalue,” thus reconciling classical determinism 
with the quantum variability. But the most striking discovery of this school of 
Maslov, Kolokolstov and Litvinov [72, 74] is that the Legendre transform which 
plays a fundamental role in all of physics and in particular in thermodynamics in the 
nineteenth century, is simply the Fourier transform in the framework of idempotent 
analysis! 

The contact between the INRIA school and the Maslov school was established 
in 92 when Maslov was invited in the Seminar of Jacques Louis Lions in College 
de France. At the BRIMS HP-Labs workshop on Idempotency in Bristol (1994) 
organized by J. Gunawardena, several of the early groups of researchers in the field 
were there, and an animated discussion took place on how the field should be named. 
The names max-plus, exotic, tropical, idempotent were considered, each one having 
its defaults. 


°As mentioned above, this result was obtained already for matrices in 1962 by R. Cuninghame- 
Green. 
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4.2.3 Tropical Geometry, Riemann-Roch Theorems 
and the Chip Firing Game 


The tropical semiring Nmin = N U {oo} with the operations min and + was 
introduced by Imre Simon in [90] to solve a decidability problem in rational 
language theory. His work is at the origin of the term “tropical” coined by Marco 
schutzenberger and used in tropical geometry which is a vast subject, see e.g. 
[45, 51, 77, 81]. We refer to [96] for an excellent introduction starting from the 
sixteenth Hilbert problem. In its simplest form (cf. [48]) a tropical curve is given 
by a metric graph I" (i.e. a graph with a usual line metric on its edges). The 
natural structure sheaf on I” is the sheaf @ of real valued functions which are 
continuous, convex, piecewise affine with integral slopes. The operations on such 
functions are given by the pointwise operations of Ryax-valued functions, i.e. 
(f V g)(x) = f(@) V g(x) for all x € and similar for the product which is given by 
pointwise addition. One also adjoins the constant —oo which plays the role of the 
zero element in the semirings of sections. One proceeds as in the classical case with 
the construction of the sheaf .#% of semifields of quotients and finds the same type of 
functions as above but no longer convex. Cartier divisors make sense and one finds 
that the order of a section f of .% at a point x € I” is given by the sum of the (integer 
valued) outgoing slopes. The conceptual explanation of why the discontinuities of 
the derivative should be interpreted as zeros or poles is due to Viro, [97] who showed 
that it follows automatically if one understands that!® the sum x V x of two equal 
terms in Ryax Should be viewed as ambiguous with all values in the interval [—oo, x] 
on equal footing. In their work Baker and Norine [3] proved in the discrete set-up 
of graphs (where g is the genus and K the canonical divisor) the Riemann-Roch 
equality in the form 


r(D) — r(K — D) = Deg(D) — g + 1 (30) 


where by definition r(D) := max{k | H°(D — t) 4 {-co}, Vr => 0, Deg(t) = k} 
and H°(D) is the Ryyax-module of global sections f of the associated sheaf @p i.e. 
sections of .# such that D + (f) => 0. The essence of the proof of [3] is that the 
inequality Deg(D) > g for a divisor implies H°(D) # {—oo}. Once translated in 
the language of the chip firing game (op.cit.), this fact is equivalent to the existence 
of a winning strategy if one assumes that the total sum of dollars attributed to the 
vertices of the graph is > g where g is the genus. We refer to [48, 82] for variants of 
the above Riemann-Roch theorem, and to [8, 40, 84] for early occurrences of these 
ideas in a different context (including sandpile models and parking functions!). 


'0s seen when using Rmax as the target of a valuation. 
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4.3 The Arithmetic and Scaling Sites 
4.3.1 The Arithmetic Site and Frobenius Correspondences 


The arithmetic site [30, 31] is an object of algebraic geometry involving two elabo- 
rate mathematical concepts: the notion of topos and of (structures of) characteristic 
1 in algebra. A nice fact (cf. [52]) in characteristic 1 is that, provided the semiring 
R is multiplicatively cancellative (i.e. equivalently if it injects in its semifield of 
fractions) the map x t» x” = Fr,(x) is, for any integer n € N*, an injective 
endomorphism Fr, of R. One thus obtains a canonical action of the semigroup N* 
on any such R and it is thus natural to work in the topos N™ of sets endowed with an 
action of N*. 


Definition 2. The arithmetic site 27 = (N*, Zmax) is the topos N* endowed with 
the structure sheaf C := Zmax Viewed as a semiring in the topos using the action of 
N%* by the Frobenius endomorphisms. 


The topological space underlying the arithmetic site is the Grothendieck topos of 
sets endowed with an action of the multiplicative monoid N* of non-zero positive 
integers. As we have seen above the semifield IR" of tropical real numbers admits a 
one parameter group of Frobenius automorphisms Fr,, A € R%, given by Fr) (x) = 
x YxeE IR}. Using a straightforward extension in the context of semi-ringed topos 
of the classical notion of algebraic geometry of a point over a ring, one then gets the 
following result which gives the bridge between the noncommutative geometry and 
topos points of view: 


Theorem 1 ((30, 31]). The set of points of the arithmetic site < over RY is 


canonically isomorphic with Xg = Q*\AQ/Z*. The action of the Frobenius 
automorphisms Fr, of IR“ on these points corresponds to the action of the idele 


class group on Xg = Q* \Ag/Z*. 


The square of the arithmetic site is the topos N** endowed with the structure 
sheaf defined globally by the multiplicatively cancellative semiring associated to 
the tensor square Zin ®B Zmin Over the smallest Boolean semifield of characteristic 
one. In this way one obtains the semiring whose elements are Newton polygons and 
whose operations are given by the convex hull of the union and the sum. The points 
of the square of the arithmetic site over IR‘'“ coincide with the product of the points 
of the arithmetic site over IR‘. Then, we describe the Frobenius correspondences 
W(A) as congruences on the square parametrized by positive real numbers A € R?.. 


The remarkable fact at this point is that while the arithmetic site is constructed 
as a combinatorial object of countable nature it possesses nonetheless a 
one parameter semigroup of “correspondences” which can be viewed as 
congruences in the square of the site. 
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In the context of semirings, the congruences i.e. the equivalence relations 
compatible with addition and product, play the role of the ideals in ring theory. 
The Frobenius correspondences W(A), for a rational value of A, are deduced from 
the diagonal of the square, which is described by the product structure of the 
semiring, by composition with the Frobenius endomorphisms. We interpret these 
correspondences geometrically, in terms of the congruence relation on Newton 
polygons corresponding to their belonging to the same half planes with rational 
slope A. These congruences continue to make sense also for irrational values of A 
and are described using the best rational approximations of A, while different values 
of the parameter give rise to distinct congruences. The composition of the Frobenius 
correspondences is given for A, A’ € IR% such that AA’ ¢ Q by the rule [30, 31] 


WA)oW(A’) = WAN’) (31) 


The same equality still holds if A and A’ are rational numbers. When 1,1’ are 
irrational and AA’ € Q one has 


W(A)o WA’) = Id. 0 WAN’) (32) 


where Id, is the tangential deformation of the identity correspondence. 


4.3.2 The Scaling Site and Riemann-Roch Theorems 


The Scaling Site A , [33], is the algebraic geometric space obtained from the 
arithmetic site <7 of [30, 31] by extension of scalars from the Boolean semifield 
to the tropical semifield R?**. The points of .e/ are the same as the points .o/ (IR{'*) 
of the arithmetic site over Ri“. But of inherits from its structural sheaf a natural 
structure of tropical curve, in a generalized sense, allowing one to define the sheaf 
of rational functions and to investigate an adequate version of the Riemann-Roch 
theorem in characteristic 1. In [33], we tested this structure by restricting it to the 
periodic orbits of the scaling flow, i.e. the points over the image of Spec Z under the 
canonical morphism of toposes © : SpecZ — & (cf. [31, Sect.5.1]). We found 
that for each prime p the corresponding circle of length logp is endowed with a 
quasi-tropical structure which turns this orbit into the analogue C, = R4_/ p” of 
a classical elliptic curve C*/q”. In particular rational functions, divisors, etc all 
make sense. A new feature is that the degree of a divisor can now be any real 
number. The Jacobian of C, (i.e. the quotient J(C,) of the group of divisors of 
degree 0 by principal divisors) is a cyclic group of order p — 1. For each divisor D 
there is a corresponding Riemann-Roch problem with solution space H°(D) and the 
continuous dimension Dimg(H°(D)) of this Rmax-module is defined as the limit 


Dimg(H°(D)) := lim. p~"dimyop(H° (D)”") (33) 
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where H°(D)?" is a natural filtration and dimiop(&) is the topological dimension of 
an Rmax-module &. One has the following Riemann-Roch formula [33], 


Theorem 2. (i) Let D € Div(C,) be a divisor with deg(D) = 0. Then the limit 
in (33) converges and one has Dimg(H°(D)) = deg(D). 
(ii) The following Riemann-Roch formula holds 


Dimg(H°(D)) — Dimg(H°(—D)) = deg(D), WD € Div(C,) 


The appearance of arbitrary positive real numbers as continuous dimensions in 
the Riemann-Roch formula is due to the density in R of the subgroup H, C Q 
of fractions with denominators a power of p. This outcome is the analogue in 
characteristic 1 of what happens for modules over matroid C*-algebras and the type 
II normalized dimensions as in [41]. 


At this point, what is missing is an intersection theory and a Riemann-Roch 
theorem on the square of the arithmetic site. One expects that the right hand 
side of the Riemann-Roch formula will be of the form 5D.D = s(f,f) when 
the divisor D is of the form 


Dif) = / Wayfayd"a 


Here f(A) is a real valued function with compact support of the variable A € R*, 
and s(f,f) is as in (17). More precisely D.D should be obtained as the intersection 
number of D o D (defined using composition of correspondences) with the diagonal 
A and hence as a suitably defined distributional trace as for the counting function 
N(u) of Sect. 4.1 so that $D(f).D(f) = s(f,f) with the notations of (17). So far 
the Riemann-Roch formula in tropical geometry is limited to curves and there is 
no Serre duality or good cohomological version of H/ for j # 0, but in the above 
context one can hope that a Riemann-Roch inequality of the type (12), i.e. of the 
form 


Dimg(H°(D)) + Dimg(H°(—D)) > 5DD 


would suffice to apply the strategy of Sect.2.3 to prove the key inequality (17) 
(Table 1). 
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Table 1 Here are a few entries in the analogy 


C curve over F, Arithmetic site of = (NX, Zimax) over B 
Structure sheaf Gc Structure sheaf Zax 

C=C @r, F, Scaling site < = ([0, 00) 1 N*, @) over Re 
CH) =CF,) ARE) = ARE) 

Galois action on C(F,) Galois action on &/(R4™) 

Structure sheaf 0é Structure sheaf G = Zmax@pRi” 

of C= C @r, Fy piecewise affine convex functions, integral slopes 
Sheaf .% of rational functions C Sheaf .#% of piecewise affine continuous functions 
onC =C @r, F, with integral slopes 

Cartier divisors = sections of /0* | Sections of #/0* 

X=CxC AXA 

D=Yvaw D= [ W(A)f(A)d*A 

Frobenius correspondence Y correspondences W(A) 


5 Absolute Algebra and the Sphere Spectrum 


Even if the Riemann-Roch strategy of Sect. 4 happened to be successful, one should 
not view the arithmetic and scaling sites for more than what they are, namely a 
semiclassical shadow of a still mysterious structure dealing with compactifications 
of Spec Z. An essential role in the unveiling of this structure should be played, for 
the reasons briefly explained below, by the discovery made by algebraic topologists 
in the 80’s (see [42]) that in their world of “spectra” (in their sense) the sphere spec- 
trum is a generalized ring S which is more fundamental than the ring Z of integers, 
while the latter becomes an S-algebra. Over the years the technical complications 
of dealing with spaces “up to homotopy” have greatly been simplified, in particular 
for the smash product of spectra. For the purpose of arithmetic applications, Segal’s 
I’-rings provide a very simple algebraic framework which succeeds to unify several 
attempts pursued in recent times in order to define the meaning of “absolute 
algebra”. In particular it contains the following three possible categories that had 
been considered previously to handle this unification: namely the category .@ of 
monoids as in [24, 26, 35, 36], the category # of hyperrings of [25, 27, 28] and 
finally the category .Y of semirings as in [21, 30, 31, 33]. Thanks to the work of 
L. Hesselholt and I. Madsen briefly explained below in Sect. 5.2 one now has at 
disposal a candidate cohomology theory in the arithmetic context: topological cyclic 
homology. 
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5.1 Segal’s I’ -Rings 


Let °°? be the small, full subcategory of the category of finite pointed sets whose 
objects are the pointed finite sets!! ky := {0,...,k}, for k > 0. The object 0+ 
is both initial and final so that "°? is a pointed category. The notion of a discrete 
I"-space, i.e. of a I”-set is as follows: 


Definition 3. A I"-set F is a functor F : '°? —> Gets, between pointed categories 
from I”°? to the category of pointed sets. 


The morphisms Hompop (M, N) between two I’-sets are natural transformations of 
functors. The category Gets, of [’-sets is a symmetric closed monoidal category 
(cf. [42, Chap. II]). The monoidal structure is given by the smash product (denoted 
X A Y) of I’-sets which is a Day product. The closed structure property is shown 
in [76] (cf. also [42, Theorem 2.1.2.4]). The specialization of Definition 2.1.4.1. of 
[42] to the case of I-sets yields the following 


Definition 4. A I-ring & is a I'-set w : T°? —> Gets, endowed with an 
associative multiplication W : SA SG > A andaunit 1: S > &, where 
S: 1°? —>+ Gets, is the inclusion functor. 


Thus [’-rings'* make sense and the sphere spectrum corresponds to the simplest 
possible J”-ring: S. One can then easily identify the category Gets, of I-sets with 
the category Ntoo(S) of S-modules. In [43], N. Durov developed a geometry over 
F, intended for Arakelov theory applications by using monads as generalizations of 
classical rings. While in the context of [43] the tensor product Z ®@g, Z produces an 
uninteresting output isomorphic to Z, we showed in [32] that the same tensor square, 
re-understood in the theory of S-algebras, provides a highly non-trivial object. The 
Arakelov compactification of Spec Z is endowed naturally with a structure sheaf 
of S-algebras and each Arakelov divisor provides a natural sheaf of modules over 
the structure sheaf. This new structure of Spec Z over S endorses a one parameter 
group of weakly invertible sheaves whose tensor product rules are the same as the 
composition rules (31), (32) of the Frobenius correspondences over the arithmetic 
site [30, 31]. The category 9!tod0(S) of S-modules is not an abelian category and thus 
the tools of homological algebra need to be replaced along the line of the Dold-Kan 
correspondence, which for an abelian category & gives the correspondence between 
chain complexes in > 0 degrees and simplicial objects i.e. objects of 4”. 


'lWhere 0 is the base point. 
'quivalently S-algebras. 
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Table 2 Short dictionary X € Chso(@) M € Mtoa(S)4” 
homology—homoto = 
ad . H,(X) 4(M) 
A,(f) : Hy(X) ~ Ay(Y) | mq(f) : 1q(M) = 1,(N) 
quasi-isomorphism weak equivalence 
th : Xn Ss Y; Cofibration 
+ projective cokernel (stable) 
Sh : Xn =e Yn Stable 
surjective if n > 0 fibration 


At this point one has the following simple but very important observation 
that I"-spaces should be viewed as simplicial objects in Gets, = SJtod(S), 
so that homotopy theory should be considered as the homological algebra 
corresponding to the “absolute algebra” taking place over S. 


We refer to Table 2 for a short dictionary. The category of I”-spaces is the central 
tool of [42], while the relations between algebraic K-theory and topological cyclic 
homology is the main topic. 


5.2 Topological Cyclic Homology 


As shown in [32] the various attempts done in recent times to develop “absolute 
algebra” are all unified by means of the well established concept of S-algebra, i.e. 
of I’-rings. Moreover (cf. [42]) this latter notion is at the root of the theory of 
topological cyclic homology which can be understood as cyclic homology over 
the absolute base S, provided one uses the appropriate Quillen model category. 
In particular, topological cyclic homology is now available to understand the 
new structure of Spec Z using its structure sheaf and modules. The use of cyclic 
homology in the arithmetic context is backed up by the following two results: 


e At the archimedean places, and after the initial work of Deninger [38, 39] to 
recast the archimedean local factors of arithmetic varieties [89] as regularized 
determinants, we showed in [29] that cyclic homology in fact gives the correct 
infinite dimensional (co)homological theory for arithmetic varieties. The key 
operator © in this context is the generator of the A-operations A(k) [75, 98, 99] 
in cyclic theory. More precisely, the action u° of the multiplicative group Ry 
generated by © on cyclic homology, is uniquely determined by its restriction to 
the dense subgroup Q% C Ri. where it is given by the formula 


k?luc, = AK", Yn >0, KE N* CR¥ (34) 
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Let X be a smooth, projective variety of dimension d over an algebraic number 
field K and let vjoo be an archimedean place of K. Then, the action of the 
operator © on the archimedean cyclic homology HC™ (cf. [29]) of X,, satisfies 


w deta a c= (9) ar 
I] Ly( W(X), s)-D at (Cea Yacex)) (35) 


a 1 
0<w<2d dete (5=(s — O)| Hex (x,)) 


The left-hand side of (35) is the product of Serre’s archimedean local factors of 
the complex L-function of X (cf.[89]). On the right-hand side, det, denotes the 
regularized determinant and one sets 


HC2(X%)) = QB ACK(X,), ACK) = CG ACr(xX,) 


n=2k>0 n=2k+1>1 


e L. Hesselholt and I. Madsen have shown (cf. e.g. [61-63]) that the de Rham-Witt 
complex, an essential ingredient of crystalline cohomology (cf. [7, 65]), arises 
naturally when one studies the topological cyclic homology of smooth algebras 
over a perfect field of finite characteristic. One of the remarkable features in their 
work is that the arithmetic ingredients such as the Frobenius and restriction maps 
are naturally present in the framework of topological cyclic homology. Moreover 
L. Hesselholt has shown [64] how topological periodic cyclic homology with its 
inverse Frobenius operator may be used to give a cohomological interpretation 
of the Hasse-Weil zeta function of a scheme smooth and proper over a finite field 
in the form (cf. [64]): 


detys(5= (8 — O)|1P.4(x)) 
detoo (+ (s ~ QO) lTP.(X)) 


O(X,s) = (36) 


The similarity between (35) and (36) (applied to a place of good reduction) 
suggests the existence of a global formula for the L-functions of arithmetic 
varieties, involving cyclic homology of S-algebras, and of a Lefschetz formula 
in which the local factors appear from the periodic orbits of the action of R4.. 


One of the stumbling blocks in order to reach a satisfactory cohomology theory 
is the problem of coefficients. Indeed, the natural coefficients at a prime p for 
crystalline cohomology are an extension of Q, and it is traditional to relate them 
with complex numbers by an embedding of fields. Similarly, (36) uses an embedding 
of the Witt ring W(F,) — C. To an analyst it is clear that since such embeddings 
cannot be measurable!’ they will never be effectively constructed. This begs for 
a better construction, along the lines of Quillen’s computation of the algebraic 


'34 measurable group homomorphism from Ze to C* cannot be injective. 
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K-theory of finite fields, which instead would only involve the ingredient of the 
Brauer lifting, i.e. a group injection of the multiplicative group of F, as roots of 
unity in C. 


5.3 Final Remarks 


The Riemann hypothesis has been extended far beyond its original formulation 
to the question of localization of the zeros of L-functions. There are a number of 
constructions of L-functions coming from three different sources, Galois represen- 
tations, automorphic forms and arithmetic varieties. André Weil liked to compare 
(cf. [10, Sect. 12] and also [104, vol. 1, p. 244-255 and vol. 2, p. 408-412]), the 
puzzle of these three different writings to the task of deciphering hieroglyphics with 
the help of the Rosetta Stone. In some sense the L-functions play a role in modern 
mathematics similar to the role of polynomials in ancient mathematics, while the 
explicit formulas play the role of the expression of the symmetric functions of 
the roots in terms of the coefficients of the polynomial. If one follows this line of 
thought, the RH should be seen only as a first step since in the case of polynomials 
there is no way one should feel to have understood the zeros once one proves that 
they are, say, real numbers. In fact Galois formulated precisely the problem as that of 
finding all numerical relations between the roots of an equation, with the trivial ones 
being given by the symmetric functions, while the others, when determined, will 
reveal a complete understanding of the zeros as obtained, in the case of polynomials, 
by Galois theory. In a fragment, page 103, of the complete works of Galois [47] 
concerning the memoir of February 1830, he delivers the essence of his theory: 


Remarquons que tout ce qu’une équation numérique peut avoir de particulier, doit provenir 
de certaines relations entre les racines. Ces relations seront rationnelles c’est-a-dire qu’ elles 
ne contiendront d’irrationnelles que les coefficients de |’équation et les quantités adjointes. 
De plus ces relations ne devront pas étre invariables par toute substitution opérée sur les 
racines, sans quoi on n’aurait rien de plus que dans les équations littérales. Ce qu’ il importe 
donc de connaitre, c’est par quelles substitutions peuvent étre invariables des relations entre 
les racines, ou ce qui revient au méme, des fonctions des racines dont la valeur numérique 
est déterminable rationnellement.'* 


Acknowledgements I am grateful to J. B. Bost for the reference [94], to J. B. Bost, P. Cartier, C. 
Consani, D. Goss, H. Moscovici, M. Th. Rassias, C. Skau and W. van Suijlekom for their detailed 
comments, to S. Gaubert for his help in Sect. 4.2 and to Lars Hesselholt for his comments and for 
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'4Tn 2012 I had to give, in the French academy of Sciences, the talk devoted to the 200th anniversary 
of the birth of Evariste Galois. On that occasion I read for the n+ Ith time the book of his collected 
works and was struck by the pertinence of the above quote in the analogy with L-functions. In the 
case of function fields one is dealing with Weil numbers and one knows a lot on their Galois theory 
using results such as those of Honda and Tate cf. [94]. 
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Navier Stokes Equations: A Quick Reminder 
and a Few Remarks 


Peter Constantin 


Abstract We describe briefly some mathematical problems related to the Navier- 
Stokes and Euler equations. 


1 Introduction 


The incompressible Navier-Stokes equations model viscous Newtonian fluids and 
can be viewed as an expression of Newton’s second law (mass times acceleration 
equals force). In the simplest case, taking a fluid of unit density and considering 
only forces resulting from interior friction and incompressibility, the equations take 
the form 


0u+u-Vu—vAu+ Vp = 0. (1) 


Here u = u(x, f) is the velocity of the fluid, u(x,t) € IR? computed at the point 
x € R@ and time t > 0, 0,u is the time partial derivative and u - V is the first order 
differential operator u- V = i uj(x, t)d,;. The Laplacian of u, Au, is multiplied 
by the kinematic viscosity v, a positive constant. The gradient of the pressure p(x, 1), 
Vp completes the equations. On components, the equations are thus 


O,Uj + uj0jUi + dip = vAu; = 0, i= 1, 2. Peers (2) 


(We use the summation convention: repeated indices are summed.) The incompress- 
ibility is the constraint 


OjUi; = 0. (3) 
The constraint (3) results in an equation for the pressure. We will say more about this 


equation in the section about the pressure, but we mention already that the pressure 
responds instantaneously to far away disturbances: it is a non-local term. 
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The equation is usually studied in d = 2 or d = 3, and x may be restricted 
to belong to some domain Q C R%, in which case the most common boundary 
conditions are homogeneous Dirichlet 


wag = 0. (4) 


In the case Q = R? these are replaced by decay assumptions at spatial infinity, 
expressed by requiring u to belong to an appropriate class of functions. There is 
also a version in which R¢ is replaced by the torus T“, that is the functions u and p 
are required to be spatially periodic with the same period. 

When the kinematic viscosity is set equal to zero the equations are called the 
incompressible Euler equations 


djutu-VutVp=0, V-u=0. (5) 


It is not possible to review even in passing the vast literature on Navier-Stokes and 
Euler equations. In this paper I touch on very few selected topics related to regularity 
of solutions and the inviscid limit. The relationship between the Euler equations [12] 
and the Navier-Stokes equations is at the core of both subjects. 


2 Vorticity 


A point of view on the subject centers on the vorticity, the antisymmetric part of the 
gradient of velocity, (Vu) — (Vu)*. In three dimensions this can be written as the 
vector valued function 


o=Vxu, (6) 
and in two dimensions the vorticity can be viewed as a real scalar valued function, 
oO= Oj uy — Oou4. (7) 


If we think of two dimensional flow as depending only on the variables x;,x2 in 
three dimensional space with coordinates (x;, x2,x3), and with zero velocity in the 
third direction, then the vorticity is a vector whose direction is perpendicular to the 
plane of (x), x2, 0) and whose signed magnitude is given by (7). The Navier-Stokes 
equations can be written in terms of the vorticity 


d@+u-Vo-—vAw=a-Vu (8) 
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and inverting (6) closes the system. The pressure disappeared. The nonlocality did 
not: the velocity depends on the vorticity in a non-local fashion. In two dimensions 
the vorticity equation is 


00 +u-Vo—vAw = 0 (9) 


and the difference between two and three dimensions appears clearly, because (9) 
has important additional properties that are absent in (8). The right-hand side of (8) 
is called the “stretching term” and is absent in (9). Consequently, in two dimensions 
the vorticity has controlled magnitude: in the Euler equations, the distribution 
function of vorticity is conserved, and hence all LZ? norms do not change in time; 
in the Navier-Stokes equations these norms are non-increasing functions of time. 
The question of regularity of the Navier-Stokes and Euler equations can be decided 
in terms of vorticity alone, and in two dimensions the decision is favorable. No 
singularities can be formed from smooth beginnings. This happens not because of 
some scaling subcriticality, but because of conservation laws in the Euler equations. 
The conservation of magnitude of vorticity on particle paths in the 2D Euler 
equations is a reflection of the coherence of the vorticity direction field. It is known 
since Leray [27] that the 3D Navier-Stokes equations have weak solutions whose 
gradients are square-summable in space-time. The Leray weak solutions retain 
the basic information that follows from the energy balance of the Navier-Stokes 
equations, namely that the kinetic energy is bounded in time, 


u€ L©(0,T; L?(R°)) 
and that the rate of dissipation of energy is square integrable in time 
u € L°(0,T; H'(R?)). 


The latter is equivalent with the statement that 


T 
w||?.dt < oo. 
: loll: 


On the other hand, if a solution satisfies 


T 
/ llo||t.dt <0 
0 


then no singularities can form from smooth initial data on the time interval [0, 7]. 
This fact can be easily proved: first, the assumed bound implies directly from the 
evolution equation of vorticity a bound that gives 


w € L®(0,T; L’(R°)), 
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together with 
wo € L’(0,T; W'?(R°)). 


These bounds signify that the solution is a “strong” solution. Once being a strong 
solution is established, then functional calculus inequalities can be used to control 
higher derivatives. 

The Leray solutions permit the definition of basic turbulence quantities, the energy 
dissipation rate €, and the Kolmogorov length scale x. The Kolmogorov scale is 
the only quantity with units of length that can be formed with the energy dissipation 
and viscosity, 


where 
€ = v(|Vul’) 


is the energy dissipation rate per unit mass. It is believed that this length is the typical 
length below which viscosity dominates. It is maybe appropriate here to discuss 
dimensions and scaling. The velocity has dimensions of [u] = LT~', the kinematic 
viscosity [v] = L’?T~!. The energy per unit mass has then units L*7~? and its rate 
of dissipation (time derivative) has units [e] = L?7~?. Solving (as it were) for L 
knowing [e] and [v] yields 0x. The scaling invariance of the equations means that 
if we changext> y= F,tres= 7 pep= wy then u(x,t) = £u(%, 4) 
obeys the Navier-Stokes equations with viscosity v if u(y, s) obeyed Navier-Stokes 
with viscosity v. If we fix the numerical value of v we can rescale solutions of 
the same equation by keeping L?T~! constant. The Reynolds number Re = 
where U is a typical velocity and L a typical length is a nondimensional number that 
helps organize solution classes. There is no deep mystery to this, but obviously, any 
correct statement should maintain its correctness after rescaling. Behavior under 
linear dilation is only one property of the equations. The nonlinear dynamical 
regularities are deeper and harder to exploit. 

Solutions of Navier-Stokes equations at high Reynolds numbers are studied 
numerically and are believed to describe experimental situations. The phenomeno- 
logically observed fact is that regions of high vorticity organize themselves in 
coherent intense vortex tubes, separated by small distances proportional to the 
Kolmogorov scale [22]. The phenomenological observations (by which we mean 
both numerical and experimental) can be summarized as follows. During the time 
evolution of turbulent three dimensional flow, strong coherent vortices form, and as 
they stretch they become more intense narrow vortex tubes. These are distributed in 
space in a disorganized manner. When two such intense vortex tubes approach each 
other as to nearly collide, threatening to create thus a dynamic singularity in the 
direction field of vorticity, they reconnect, performing thus a topological change, 
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prohibited in smooth Euler solutions, but permitted in the viscous evolution. The 
vorticity magnitude then (and there) locally decays. Several such events may occur. 
Topological change (vortex reconnection) is a nonlinear regularizing mechanism 
that might prevent singularity formation but it is difficult to formalize and to use in 
rigorous mathematical arguments. It is known that in 3D Navier-Stokes equations, 
if the direction of the vorticity 


is locally well behaved (locally Lipschitz continuous in regions of high vorticity), 
then no singularities can arise from smooth initial data. The regularization is due to a 
geometric depletion of nonlinearity: if the direction of vorticity is coherent, then the 
stretching term is no longer of the order of w7 but rather of the order £~'uw, where 
lis the coherence length. We use vaguely the term “coherence” but in this context 
it simply means “|V&| locally bounded”, and the coherence length is £ ~ |V&|7!. 
The theorem of [14] was generalized both in what concerns integrability conditions 
and geometric context [5, 23, 24]. The persistence for all time of a coherence length 
of the vorticity however is not known. What is known is an average bound for weak 
solutions: 


T 
vf / |w||VE|?dxdt < Ep 
0 JR 


where Eo is proportional to the initial energy of the solution tes |uo|?dx [9]. This 
bound says that, in regions of high vorticity, the direction of vorticity is coherent 
on average. If the direction of vorticity is coherent, then no singularities can form. 
There is a gap however between the sufficient condition on V€ ensuring regularity 
and the known average bound above. 

The fact that infinite vorticity is needed in order to produce a blow up in the 
Navier-Stokes equation is qualitatively similar to the situation in Euler equations, 
where singularities cannot form from smooth initial data on a time interval [0, T] 
if [2] 


T 
/ |@|| Leo dt < co. 
0 


The amplification of vorticity by the inviscid vortex stretching mechanism can 
conceivably be a route to regularity in Navier-Stokes equations. 
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3 Velocity 


Singularity formation in unforced Navier-Stokes equations in R? requires infinite 
velocity. There is no such result for the Euler equations, without additional 
assumptions about the nature of the blow up. In fact, this represents a marked 
difference between the two equations. For Navier-Stokes equations, if 


T 
. 
i ||Ul| poo qgsAt < 00 


then no singularities can arise from smooth and localized data in the time interval 
[0, T]. Or, if 


sup ||ul|z3¢@3) < 0° 
te [0,T] 


then no singularities can arise from smooth and localized data in the time interval 
[0,7]. The sufficient condition involving ||u||;0o(@3) is easy to prove. In fact, 
one integration by parts in the stretching term of the vorticity equation and 
straightforward estimates show that this condition results in an a priori bound for 
w € L©(0,T; L?(R*)) N L?(0, T; W!?(R)), and from then on we are in the case of 
strong solutions. 

By contrast, the sufficient condition involving ||u||73(3) is hard to prove [19]. 

As is the case with the conditions involving vorticity, there are gaps between 
these sufficient conditions and generally known results on the corresponding 
quantities. For the L° norm it is known that 


i 
/ ull pao py 60 
0 


and for the L? norm it is known that 


. 4 
i ells gaya. 


The first result was known for many years [21] (see also [10]). The second follows 
by interpolation directly from the known Leray bounds and Morrey’s inequality. The 
celebrated result of [6] bounds the Hausdorff dimension of the singular set of suit- 
able weak solutions of the Navier-Stokes equations. These solutions are not known 
to be unique. They are obtained as limits of good evolutionary approximations, they 
exist for arbitrary long time and they have numerous interesting properties. The 
singular set is the set in space-time where such a solution has infinite velocity. 
Its dimension is at most 1. The dimension of the singular set at the first blow up 
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time ought to be smaller. (The first blow up time is the putative time T where 
ie elle o gays < oo fort < T but ic lll Zo qqayds = oo.) The singular set is 
then the set of points x such that lim sup,_,....7,s<r |UQ, 5)| = 00. 


4 Pressure 


Singularity formation in unforced Navier-Stokes equations in R? requires infinite 
pressure. Specifically, if 


T 
/ plist < 00 


then no singularity can arise, from smooth and localized initial data [4] in the time 
interval [0, T]. Also if the pressure is bounded below 


inf p>-—oo 
0<1t<T,xER} 


then no singularities can occur in weak solutions [29]. Of course, the pressure 
is actually defined only up to a constant, and these criteria require a convention 
determining it. The pressure obeys a Poisson equation 


—Ap=V-(u- Vu) 


and in the whole space we may assume that the pressure decays at infinity. Then a 
natural choice for (—A)~! is convolution with the Green’s function for the whole 
space, and that gives an explicit formula for the pressure in terms of the velocity. 
Using the result that velocity bounded in u € L®(0, T; L3(IR*)) implies regularity, 
it is easy to recover the result that the pressure bounded in p € L7(0,T; L3(R)) 
implies regularity. Indeed, this follows from the inequality 


/ pu-V\uldx 
R3 


by standard manipulations (Hélder inequality, Morrey inequality). Because the 
pressure controls the magnitude of velocity and because a Navier-Stokes blow up 
requires the velocity to become infinite, the study of pressure is important for the 
regularity problem. In fact, in the absence of pressure, or if the pressure would have 
been a local function of velocity magnitude, then no singularities could arise in the 


Navier-Stokes equations. The regularity criterion involving fi || «|| va att would 


suggest that a condition like i IP lz-o¢a3) < CO would be a sufficient condition for 
regularity. 


d 
— |u|dx + vf |Vul*|uldx < 
3dt JR3 R3 
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There exists an explicit, quasi-local decomposition of the pressure [11] using the 
spherical averages 


1 
P= sa] povasoy, 


at arbitrary r > 0. The pressure is expressed as 


P(x) = B-(x) + 1-(x) 


where £, is a local average of the pressure, 


1 ar 
pio) == [ p,ap. 


The expression for z,(x) is computed from integrals in the ball of radius 2r of 
velocity increments 


b-uj(x) = uj(x + z) — uj(x) 


squared: 


[ZI 
nts) = fw (2) si@.nen Baylor 
where kj is symmetric in i,j, homogeneous of order —3 in the ball of radius one 
around the origin, smooth away from the origin, and it is explicit. The weight w is 


supported in [0, 2], equals one for 0 < A < 1, and 2—A on [1, 2]. The function zr, 
vanishes for the harmonic part of p, and is quadratically small in r: 


Ilr Ilzace3) S Cr” || Vullroacqsy 


for 1 < q < oo. On the other hand, the function 6, obeys very strong bounds for 
positive r: 


3) 2 
|B; Ilzcogesy < Cr *lluller@sy> 
-4 
IV Brllzcocesy < Cr“ [lull easy: 


These bounds follow from the equation for the pressure, using a kind of monotonic- 
ity equation for a modified object 


bx) =3 : — * 4s 
ea ae = 00) 
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Bounds of the form 


Brllascasy S Cgllellc2aqgsy 
for | < q < o are true for r = 0 as well, and follow from the Calderon-Zygmund 
bounds on p. The bounds for 6, are obviously sufficient for regularity. The bounds 


on z, are not. The smallness of z, is remarkable though. For instance, it follows 
from the well-known bound [11, 21] 


g 2 
i || Aull (>gay at <0o 


that 
Il7-Ilz3¢@3) S cir 
holds almost everywhere in [0,7] with ne C (t)3dt depending only on the initial 
energy || woll?, (py E and v. A stronger form of smallness of 7 is all that is needed 
for regularity. Indeed, if there would exist r > 0 (independent of time, although this 
requirement could be relaxed) and a constant C such that 
u-Va,+v|Vul? +C>0 

pointwise, then we would deduce regularity. This condition is a form of the 
requirement of the existence of positive r such that the local Reynolds number 


at scale r is small. If this condition is assumed, then regularity follows from the 
decomposition of the pressure at scale r, p = 6, + 2, because 


[i u- Welle 
is a priori finite (for fixed positive r) and because the inequality 
(0, + u- V—vA) |ul? < F(t) 
with i F(t)dt < oo results in 
lw IPoogesy < FO 


by the maximum principle applied to |u(x, #)|? — £ F(s)ds. 
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5 Zero Viscosity 


Many of the interesting mathematical problems about Navier-Stokes and Euler 
equations originate in the attempt to understand questions related to turbulence. One 
of the basic tenets of turbulence is the non-vanishing of energy dissipation € in the 
limit of high Reynolds numbers. The average energy dissipation of energy per unit 
mass is an important parameter in turbulence theory, where the Kolmogorov energy 
spectrum is given by 


E(k) = Cretk-3 


in the inertial range. The power law is deduced on dimensional grounds. If the 
answer depends only on the wave number k (which has dimensions of L~!) and 
€ (which has dimensions [e] = L*7~+), and if the answer is a product of powers, 
E = €°k’, then because the energy spectrum which has units of energy per wave 
number, i.e. [E] = L>T~? , equating powers of L and T we deduce the Kolmogorov 
spectrum. The trouble with this argument is not that it is simplistic, but rather that 
it gives the “right” result, for an astounding range of turbulent experiments and 
numerical simulations. The constant Cx is not dependent on Reynolds number. 
Because ¢€ = v(|Vu|*), no matter what interpretation we give to the average, 
somehow the average gradient of velocity should saturate the nonzero bound, in 
the limit of zero viscosity. This is termed by physicists “anomalous dissipation”. 

There are two distinct approaches to the question of anomalous dissipation. 
In the first, the limit of zero viscosity is taken on solutions of the initial value 
problem with fixed initial data. Under appropriate conditions this leads to a solution 
of the corresponding initial value problem of the Euler equation. This equation 
conserves energy if solutions are smooth, but might dissipate energy if solutions are 
not sufficiently smooth. This circle of ideas, and specifically the precise degree of 
smoothness needed, goes by the name of “Onsager conjecture” [20]. This approach 
is therefore about the initial value problem for the limit equations and it requires 
lack of smoothness of solutions. The blow up problem is open for 3D incompressible 
Euler equations, and this allows to envision the possibility of existence of dissipative 
solutions arising from smooth initial data. Anomalous dissipation of energy can 
be proven for incompressible 2D Euler equations as well, for very non-smooth 
solutions, although in 2D non-smooth solutions cannot arise spontaneously from 
smooth ones. The Onsager conjecture states roughly that energy is conserved on 
solutions of Euler equations if they are smoother than C 3, and that there are 
solutions of the Euler equations with exactly C 3 smoothness which dissipate energy. 
The first part of the statement is pretty much proved [7, 13]. Much progress has been 
made on the second part of the statement [3, 18]. 

The second way of looking at the anomalous dissipation issue is to take long time 
averages first, in order to achieve a “permanent regime” of the viscous equations, 
and only then send the viscosity to zero. This procedure has the advantage of being 
more appropriate to a turbulence setting which requires a statistical description. 
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Turbulence is generated at boundaries or through other forcing. Anomalous dis- 
sipation is much harder to establish in this case. The difficulty is conceptual, 
because it is necessary to produce long lived solutions that achieve an equilibrium 
that is not obtained by the balance of forcing and viscosity, rather by the balance 
of nonlinearity and forcing. In two dimensions if nonvanishing linear damping is 
imposed then a dissipation anomaly cannot exist [16, 17]. 

The finite time relationship between the Navier-Stokes equations and Euler 
equations is understood only for as long as the solutions of the Euler equations are 
smooth, and only if no boundaries are present (i.e. R¢ or T?, d = 2, ord = 3 under 
the assumption of smoothness.) In these cases, if the Euler solution starting from a 
smooth initial datum remains smooth on a time interval [0,7], then there exists a 
viscosity V9, depending on the Euler solution and on T, such that for all v < vo, the 
solution of the Navier-Stokes equation with the same initial datum exists on [0, 7], 
is smooth, and v-close in strong norms to the Euler solution [8, 25, 28]. 

In the case of boundaries, the mathematical problem is wide open, despite more 
than a century of effort. The difficulty stems from the presence of boundary layers, 
vanishingly small regions near the boundary where high gradients of the solutions 
are concentrated. In many studies, a reference smooth Euler flow is supposed to 
be known and used to set units of time and length. In other words, it is assumed 
that a smooth Euler solution is given and has velocity of order one and derivatives 
of order one. The classical Prandtl boundary layer length scale associated to the 
Eulerian solution is of order ./v and an asymptotic description based on this given 
Euler flow is attempted. An asymptotic description of the Navier-Stokes equation, 
based on a given “nearby” smooth Euler flow is difficult because the connection 
between the imposed Euler flow and the Navier-Stokes equation is illusory near the 
boundary. 

It was shown by Kato [26] that if the rate of dissipation vanishes on a much 
smaller scale, 


T 
lim / : v| Vu |?dxdt = 0 
¥>0 Jo Jdist(xdQ) <cv 


for Navier-Stokes solutions with viscosity v, then inviscid limits solve the Euler 
equation. In fact, in this case any weak limit of Navier-Stokes solutions is a weak 
dissipative up to boundary [1] solution of the Euler equations. There are variants of 
this sufficient condition ensuring convergence to an Eulerian solution which allow 
for more singular behavior, if there are no turning points in the flow [15]. The 
possibility of the inviscid limit to be a solution of the Euler equations exists, but 
is very limited. In general, the behavior of the inviscid limit might fail to be purely 
Eulerian. How does the laminar Eulerian picture break down (if indeed, as I suspect, 
it does) is still a mystery. Much remains to be done. 
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Plateau’s Problem 


Jenny Harrison and Harrison Pugh 


Abstract Plateau’s problem is not a single conjecture or theorem, but rather 
an abstract framework, encompassing a number of different problems in several 
related areas of mathematics. In its most general form, Plateau’s problem is to 
find an element of a given collection @ of “surfaces” specified by some boundary 
constraint, which minimizes, or is a critical point of, a given “area” function 
F : @ — R. In addition, one should also show that any such element satisfies 
some sort of regularity, that it be a sufficiently smooth manifold away from a well- 
behaved singular set. The choices apparent in making this question precise lead to 
a great many different versions of the problem. Plateau’s problem has generated 
a large number of papers, inspired new fields of mathematics, and given rise to 
techniques which have proved useful in applications further afield. In this review 
we discuss a few highlights from the past hundred years, with special attention to 
papers of Federer, Fleming, Reifenberg and Almgren from the 1960s, and works by 
several groups, including ourselves, who have made significant progress on different 
aspects of the problem in recent years. A number of open problems are presented. 


1 Introduction 


Plateau’s problem has intrigued mathematicians and scientists alike for over 200 
years. It remains one of the most accessible problems in mathematics, yet retains 
a subtle difficulty in its formulation. Many different versions of Plateau’s problem 
have been solved, but even today there are still important questions left unanswered, 
and deep mysteries about the problem still remain. 

Plateau’s problem was first posed by Lagrange, who in 1760 derived the minimal 
surface equation and asked if one could find a surface of minimal area with 
a prescribed boundary. The problem was later named after Plateau [74] who 
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Fig. 1 Three surfaces meeting at a triple junction singularity 


| 


Fig. 2. Non-standard boundaries. (a) A wireframe as a boundary. (b) A non-closed boundary 


undertook a physical study of soap films and characterized their properties, most 
notably their singularities. It was actually Lebesgue who coined the term “Plateau’s 
problem,” the crux of which was to describe these soap films in mathematical terms. 
These objects do not behave like classical surfaces such as embedded, or even 
immersed manifolds with branch points. Three sheets can come together along a 
line and form what is known as a triple junctions (Fig. 1). Soap films can span wires 
that are not cycles (Fig. 2). Some films are local area minimizers, yet can retract 
onto their boundaries (Fig. 3). 

The long journey towards a full understanding of these phenomena began with 
a much simpler question about the existence of a function with prescribed values 
on the boundary of a domain 2 C R?, such that the graph of the function on the 
interior of §2 is a minimal surface. This problem was studied by Weierstass and 
Riemann and evolved into the classical theory of minimal surfaces. The next step 
up in generality came with the study of surfaces defined as images of disks. Jesse 
Douglas won the first Fields Medal for his solution, which proved the existence of 
a minimally immersed disk in R* with a prescribed contour boundary. Many others 
continued on with striking results of existence and regularity along the way, and the 
Douglas-Plateau problem for surfaces with higher (non-infinite) genus in arbitrary 
dimension and codimension was finally solved by Jost in 1985. 
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Fig. 3 The Adams surface (a) retracts onto its boundary (b). The left portion of (a) is a triple 
Mobius band (a “Y” cross an interval, glued along the ends with a 1/3 twist, see also Fig. 9b) and 
the right portion is a classical Mobius band. They are joined by a bridge, so that the boundary is a 
single Jordan curve 


However, Fleming demonstrated the existence of a contour boundary which 
bounds a minimal surface of infinite genus (Fig. 4). In 1960, Federer and Fleming 
introduced objects known as integral currents which could model these somewhat 
pathological surfaces. Their novel approach won them the Steele prize and helped 
launch the modern study of geometric measure theory. They proved the existence 
of an integral current with a given boundary which minimizes mass, a quantity 
which can be thought of as area weighted by an integer multiplicity. It later became 
known that in low enough dimension, their minimizing current corresponded to an 
embedded minimal submanifold. 

At the same time, Reifenberg thought of a completely different approach. 
Building on results of Besicovitch and aided by Adams, he defined what it meant 
for a surface to span a bounding set A using Cech homology. Using his theory, he 
proved the existence of a surface with minimal area amongst those surfaces which 
can be written as a nested union of manifolds whose boundaries converge to the 
contour. These surfaces include non-orientable surfaces, as well as the example of 
Fleming, amongst others. His work is considered to be a masterpiece and deeply 
influenced several mathematicians, including Morrey, Almgren, Fomenko, as well 
as ourselves. 

Almgren proposed three approaches to Plateau’s problem. The first used varifolds 
[4], which Young [93, 94] had discovered but called “generalized surfaces.” Integral 
varifolds have a compactness theorem [2] which can be used to prove the existence 
of a stationary varifold with smallest area. Integral varifolds model just about any 
imaginable minimal surface, including the example of Adams (Fig. 3). However, 
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Fig. 4 Fleming’s example of a minimal surface with infinite genus. (a) A minimal surface with 
infinite genus. (b) The boundary wire can be made smooth except at a single point 


Almgren did not prove that this smallest stationary varifold was the smallest among 
a class of surfaces which also included non-stationary varifolds. 

Almgren’s second approach [7] was an attempt to generalize Reifenberg’s results 
to elliptic integrands. To read his paper requires expertise in methods of geometric 
measure theory, varifolds, integral currents, and flat chains, for it blends them all. It 
has some gaps, one of which seems serious (see [57] for more details.) 

Almgren’s third and final approach [9] was to define a new class of surfaces 
which later became known as quasiminimal sets, which, roughly speaking, have 
a controllable increase of area under small deformations. Although he was unable 
to prove an existence theorem of an area minimizer in this category, his regularity 
results are of major importance. 

The authors have recently announced the first solution to the full elliptic Plateau 
problem [57]. Our proof of existence of minimizers builds upon classical measure 
theory, and techniques of Reifenberg [77], Federer and Fleming [45]. Our proof 
of regularity relies upon Almgren [9], although Reifenberg [78, 79] is closely 
related. Spanning sets can be defined using homology, cohomology or homotopy. 
An axiomatic approach without requiring any definition of a spanning set is also 
provided. Our results carry over to ambient spaces of Lipschitz neighborhood 
retracts, including manifolds with boundaries and manifolds with singularities. 

In the last few years there has been a flurry of other activity in Plateau’s problem 
and in related fields, some of which we shall discuss below. For example, papers 
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Fig. 5 A piece of the helicoid, which extends to infinity in all directions 


have recently appeared on sliding boundaries, where the soap film’s interface within 
a larger boundary is permitted to move freely, on flexible boundaries, where the 
boundary itself is permitted to move subject to forces created by the spanning soap 
film, and on axiomatic theory, ellipticity, spanning conditions, and classical minimal 
surface theory. Indeed, very recently, a beautiful and central theorem in classical 
minimal surfaces was proved by Meeks and Rosenberg, namely that every simply 
connected, properly embedded minimal surface in R? must be either a plane or a 
helicoid (Fig. 5.) 

There are still many major questions in the world of Plateau’s problem, indeed 
the list seems to be growing, not shrinking. We have enumerated several of our 
favorites at the end of this paper, some of which are newly posed. 


Disclaimer 


Before diving in, the reader should be aware that the following exposition is far 
from complete and will often be imprecise. We hope that it will be useful to give 
non-specialists some idea of the history of Plateau’s problem, a few lines of current 
theoretical development, and some open problems, both enduring and emerging. 
We have endeavored to give a broad overview of many different aspects of the 
problem, and in doing so, have, by necessity, left out many important contributions 
by numerous mathematicians. If we have neglected to mention your favorite result, 
it was not due to malice, but rather due to the constraints of writing this article. If 
we have incorrectly stated your favorite result, know that we are experts in only a 
small portion of the Plateau problem, and would welcome any corrections you might 
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provide. Lastly, the authors would like to thank Emanuele Paolini,! Ken Brakke,” 
and Claire-Audrey Bayan? for the use of their soap-film figures. 


2 Classical Minimal Surfaces 


2.1 The Minimal Surface Equation 


Let ¥ be a surface in R*. We say that ¥ is a minimal surface if every point in has 
an €-neighborhood U which has least area among all surfaces § C R* with boundary 
dU. This condition is equivalent to 7 having vanishing mean curvature, and to the 
condition that 4 Area(Z,) |,=o= 0 for all compactly supported variations ¥, of X’. 
These are the two (and higher) dimensional analogs of geodesics, but one must be 
careful in this comparison: even in R”, a minimal surface with a given boundary 
may not have smallest area amongst all surfaces with that boundary. For example, 
consider two horizontal disks in R? separated vertically by a small amount. Their 
union forms a minimal surface, but the cylinder has smaller area, and the catenoid, 
smaller still (Fig. 6). 

A special kind of minimal surface & is one which occurs as a graph: 

Suppose 92 is a bounded open set in R? with locally Lipschitz boundary 0, and 
suppose g : 082 — R is continuous. Let F denote the set of continuous extensions 
of g to §2 which are continuously differentiable in {2 and whose derivative is 
integrable. Let A(f) denote the surface area of the graph of such a function f and 
suppose f € F solves Plateau’s problem for this setup. That is, f satisfies 


Fig. 6 Minimal surfaces with the same boundary. (a) Two disks. (b) Catenoid 


'Figures 2b, 3-7, 9. 
Figure 11. 
3Figure 8. 
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A(f) <A(h)  forallhe F. (1) 


; Vif) 


This differential equation is called the minimal surface equation. It is the Euler- 
Lagrange equation for the area functional. 


Then, 


Theorem 1. A function f € F satisfies (1) if and only if f satisfies the minimal 
surface equation (2). 


Such a function f is unique, and is in fact analytic on {2 [60]. 

Suppose now the domain 2 is a disk D of radius r. This produces a unique min- 
imal surface for each continuous function g on S', and thus there are uncountably 
many minimal surfaces that are graphs over disks of radius r < oo. However, if 
r = ov, the situation simplifies dramatically: 


Theorem 2 (Bernstein [13]). Any solution of the minimal surface equation (2) 
which is defined on all R? must be linear. 


Bernstein conjectured that this was also the case in higher dimension. Indeed this is 
the case up to dimension seven: 


Theorem 3 (de Giorgi [27], Almgren [6]). The Bernstein conjecture holds for 
minimal graphs T = {(x,f(x)) € R"t! : x € R'} forn < 4. 


Theorem 4 (Simons [81]). The Bernstein Conjecture holds for minimal graphs 
when n < 7. 


2.2 Recent Developments 


In the classical theory, non-compact minimal surfaces might not be graphs and 
might not have a boundary. An example in R? is the helicoid (Fig.5). A long 
outstanding question posed by Osserman was the following generalization of 
Bernstein’s Conjecture: The plane and the helicoid are the only properly embedded, 
simply-connected, minimal surfaces in R?. 

Osserman’s conjecture has recently been solved after many years of effort by 
Meeks and Rosenberg [70] who built on the work of Colding and Minicozzi [21, 22] 
as well as a number of other mathematicians (see [68] for a detailed discussion and 
a more complete list of citations.) 

Combining this with work of Collin [24], Lépez and Ros [63], Meeks et al. [69] 
give the following classification theorem: 
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Theorem 5. Up to scaling and rigid motion, any connected, properly embedded, 
minimal surface in IR3 is a plane, a helicoid, a catenoid or one of the Riemann 
minimal examples. In particular, for every such surface there exists a foliation of R* 
by parallel planes, each of which intersects the surface transversely in a connected 
curve which is a circle or a line. 


This beautiful theorem is only just the beginning of what looks to be a new era 
in classical minimal surface theory. A few particularly intriguing open problems are 
listed in Sect. 7 and can be found as part of a larger list in [68]. 


3 The Douglas-Plateau Problem: Immersions of Disks 
and Surfaces of Higher Genus 


The Plateau problem, in its original formulation, was to find a minimal immersed 
disk whose boundary was a given Jordan curve in R”. For any immersed disk, 
coordinates on the disk D can be chosen so that the immersion is conformal, in 
which case minimality is equivalent to the immersion being harmonic. 

Independently, Douglas [39] and Rado [75] proved the existence of such a 
surface, with (possibly) isolated singularities: 


Theorem 6. [fC is a Jordan curve in R", there exists a continuous map: D —> R" 
which is conformal and harmonic away from a set of isolated singularities (i.e. 
branch points), such that t| ap parameterizes C. 


The solutions produced by Douglas and Rad6 also had minimal area in the class 
of branched immersions. However, Douglas could prove slightly more than Rad6, 
principally his theorem allowed for certain pathological boundaries C which could 
only be spanned by disks of infinite area.* In addition, Douglas’s methods signaled 
a significant and promising departure from the classical techniques, and for these 
reasons he was awarded the first Fields medal in 1936. 

Osserman proved [72] that if n = 3, then branch points did not exist in such 
minimal disks. Thus, 


Theorem 7. Jf C is a smooth Jordan curve in R?, there exists a (conformal, 
harmonic) immersion t : D — R? such that t|ap parameterizes C, whose area 
is minimal among all immersed disks whose boundaries parameterize C. 


“Douglas was well known for his displeasure at having to share credit with Rad6 for Theorem 6. 
When teaching subsequent geometry courses, he eschewed those books which, in covering the 
theorem, contained the attribution “Douglas-Rad6.” Unfortunately, as the years went on, he was 
forced to use increasingly antiquated texts, since virtually no book published after the 1930s failed 
to give this (correct) attribution (See [85]. The second author has also heard a similar story from 
Martin Bendersky, who was a student in one of Douglas’s courses at City College.) 
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a 2 5 b > 
Fig. 7 A non-orientable surface with smaller area than any immersed disk with the same boundary. 
(a) An immersed disk spanning the boundary of the Mobius band. (b) A Mobius band 


EH 


Fig. 8 The transverse intersection of two sheets can be replaced with a pair of triple junctions with 
smaller area. These singularities do not, however, show up in the mass minimization problem (see 
Sect. 4.2), as the horizontal portion of the right hand figure would require higher mass to cancel 
out any contribution of the triple junction to the boundary 


However, there still could be immersed surfaces with boundary C whose area 
is strictly less than those of the solutions produced above. Consider a thin Mobius 
band. Its area is less than that of any immersed disk spanning the boundary curve 
(Fig. 7). Higher genus surfaces could also have less area. Douglas had made attempts 
to generalize his techniques to account for possibly non-orientable surfaces and 
those of higher genus, but the consensus seems to be that his arguments were 
incomplete [52]. It took until the 1980s for a complete solution to the higher genus 
problem to appear in a paper by Jost [61] (see [14] for the non-orientable case), 
who built upon ideas of Schoen-Yau [84] and Sacks-Uhlenbeck [88]. Tomi-Tromba 
[87] soon after offered a different solution to the higher genus problem based on a 
development of Teichmiiller theory from the viewpoint of differential geometry. 

Although not a mathematical shortcoming, self-intersections in which two sheets 
intersect transversally can easily show up in Douglas’s and others’ immersed 
solutions. Such solutions are not physically realistic as soap-films, since transverse 
intersections resolve into pairs of triple junctions (Fig. 8) with smaller total area 
when one is allowed to consider surfaces more general than immersed manifolds. 
These generalized surfaces with triple junctions became important in the 1960s in 
the work of Reifenberg [77], Almgren [7] and Taylor [86], who studied soap-film 
regularity and classified the singularities for these general size-minimizing surfaces. 
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4 Orientable Generalized Surfaces 


Fleming’s example (Fig. 4) shows that one must consider all topological types to 
find a true minimizer for an orientable version of Plateau’s problem. The curve in 
the figure is an unknotted simple closed curve and is smoothly embedded except 
at one point. It bounds the shaded surface which is orientable and clearly area 
minimizing. This shows that one should not insist that competitors have finite 
topological type when looking for absolute area minimizers. The proof of the 
existence of an orientable surface minimizing the area among all possible surfaces 
without restriction on their topological type requires other techniques, namely the 
results of Reifenberg [77] and the integral currents of Federer and Fleming [45]. 


4.1 Hausdorff Measure 


Readers will recall that for any E C R", and any non-negative real number k, the 
k-dimensional Hausdorff measure of E is 


HO*(E) = lim EXE) 
where 


HGS (E) = ay inf } Y~ diam(Uj)* : E C UjerUig 


ie] 


a, is a normalizing constant, and the infimum is taken over all coverings of E by a 
collection {U;}ic7 of sets with diam(U;) < 6. The normalizing constant a; is chosen 
so that when k is an integer, the k-dimensional Hausdorff measure of the unit cube 
in R* is one. 

Note that a (E) is monotone decreasing in 6, so the limit limso Tae (E) 
exists but may be infinite. .#* is a Borel regular outer measure and coincides with 
Lebesgue measure when E is a k-dimensional submanifold. 

The Hausdorff dimension of E is the infimum over all k > O such that 
KH*(E) = 0. 


4.2 Integral Currents and Mass Minimization 


A k-dimensional current T on an n-dimensional smooth manifold M is a linear 
functional on the space of compactly supported smooth k-forms Y*(M), continuous 
in the following sense: 
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If w; is a sequence of k-forms supported in a single compact set K contained in 
a coordinate neighborhood and 0’@; — O uniformly for all 0 < |r| < oo, then 
T(a;) > 0. Here r = (rj,..., 7) is an-tuple of non-negative integers, |r| = > 7, 
and 0” is shorthand for the conidanate: -wise differentiation operator 


girl 


Ox; 2. Ox 


It is important to note that the topology on Y*(M) is strictly finer than the 
subspace topology induced by the inclusion of Y*(M) into the space &*(M) of 
C™ differential k-forms on M, in which a sequence of forms n; converges to zero 
whenever 0"7;|«.—> 0 uniformly for all 0 < r < i and all compact sets K contained 
in a coordinate neighborhood. The difference is subtle, but has extremely important 
consequences (the full description of the space J”, its topology, and continuous dual 
was the major component of L. Schwartz’s Fields medal.) For example, in £°(R), 
any sequence of bump functions f; equal to 1 on the interval [—i, i] converges to 
the function 1, which is no longer compactly supported. Such a sequence is not 
convergent in Y°(R). As a matter of fact, Z*(R”) is complete, so it is not even 
Cauchy. 


4.2.1 Examples 


¢ If S C M is an oriented k-dimensional submanifold, then a k-current [[S]] is 
defined, setting [[S]](w) = flo 

¢ If M is equipped with a volume form dV, then a k-vector field X on M defines a 
current [[X]], whereby [[X]](w) = J), o(X)dV. 

¢ A generalized Dirac delta is a current: if p ¢ M anda € A*(T,M), then a current 
[[(p, &)]] is defined, where [[(p, a) ]](@) = @,(a). 

* An (n—k)-form 7 in &”~*(M) defines a k-current [[7]], where [[7]](@) = Jy Nhe. 
Such a current is called a smooth current. Through convolution, it is possible to 
construct a smoothing operator which approximates any current by a smooth 
current (See [76, Sect. 15]). 


Denote the space of k-dimensional currents by %(M). The operator dual to 
exterior differentiation on forms, denoted 0, turns Y(M) into a chain complex. 
The image and kernel of 0 are closed. When given the opposite grading (i.e. give 
GY,(M) degree n — k), the resulting cochain complex (F,—e, 0) is quasi-isomorphic, 
via application of the aforementioned smoothing operator, to the cochain complex 
(€°(M), d). Thus, Poincaré duality holds in this setting: the homology of currents in 
degree k (which is dual to the compactly supported de Rham cohomology in degree 
k) is isomorphic to de Rham cohomology in degree n — k. 

Before we can describe a sub-complex of (%, 0) which computes the integral 
homology of M, it will be necessary to define the mass of a current. If M is equipped 
with a Riemannian metric and W C M, define ||T||(W) = sup{T(@) : supp(w) C 
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W, |l@llo < 1}, where ||@||o is the supremum of w,a, where p € M and o is a unit 
simple k-vectors in A‘ T,M. The (possibly infinite) mass of T, denoted M(7), is the 
quantity ||T||(R”). If ||T||(W) is finite for every W CC M, we say T has locally 
finite mass. 

A k-dimensional current T is called (integer) rectifiable if it has locally finite 
mass and there exists a sequence S; of C! oriented k-dimensional submanifolds of 
M, a sequence of pairwise disjoint closed subsets K; C S$; and a sequence of positive 
integers k; such that 


70) =k fo 


for all a € Z*(M). If T and OT are rectifiable, we say that T is an integral current. 
One can show, e.g., using sheaf theory, that the homology of the chain complex 
(.%.(M), 0) of integral currents computes the homology of M with Z coefficients. 

One can also show that the mass M(T) of an integral current T is the same as the 
quantity )>, k; 7"(Ki). 

Central to the utility of integral currents is the following compactness theorem: 
Theorem 8 (Federer-Fleming). [f{T7;} C -%(M) is a sequence of integral currents 
such that 


sup | Ti||(W) + |laTi[|(W) <0 


for all W CC M, then there exists an integral current T and a subsequence of {T;} 
which converges weakly to T. 


Since mass is weakly lower-semicontinuous, Federer-Fleming produced the 
following corollary: 


Corollary 1. [fT € .4,.(M), then there exists Ty) € %,(M) with T — Ty = ORo for 
some Ro € 44 such that 


M(7o) = inf M(7o + OR). 
(To) hos (To + OR) 
As a special case, setting Q = OT: 
Corollary 2. If Q € 4_,(R"), there exists Ty) € 4,(IR") with dT) = Q such that 
M(7o) = inf M(T). 
TE,,IT=O 


Another special case occurs when T is a cycle: 


Corollary 3. Each class in H,(.%(M), 0) contains a representative of least mass. 
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The support supp(T) of a current T € ZY is the complement of the largest open 
set U for which supp(w) C U => T(w) = 0. We say p € supp(T) \ supp(0T) is 
an interior regular point if there exists € > O, a positive integer x and an oriented 
k-dimensional smooth submanifold S such that T(w) = «[[S]](@) for all forms w 
supported in the ball of radius € about p. The remaining points in supp(T) \ supp(0T) 
are called interior singular points, the set of which will be denoted A(T). 


Theorem 9 (Complete Interior Regularity). [fT € -%,(IR"t'), where 2 <n < 6, 
and the mass of To is minimal among all integral currents with the same boundary, 
then supp(To) \ supp(0To) is an embedded minimal hypersurface in R" \ supp(0To), 
and YP (To) is empty. 


In 1962, Fleming proved the result for n = 2, so other than regularity at the 
boundary which was to take another 17 years [59], this result completed the solution 
of the oriented Plateau Problem in R? for surfaces of all topological types. 

Almgren [6] extended Fleming’s theorem to n = 3, and Simons extended it up 
ton = 6in [81]. 

Also in [81] Simons constructed an example which showed that singularities 
could in fact occur in dimension 7 and higher. The “Simons cone” is the cone over 
S? x $3 CS’ C R®. He showed it was locally mass minimizing, yet has an isolated 
interior singularity. Immediately after Simons published his example, Bombieri, de 
Giorgi, and Giusti [12] showed in a marathon three-day session? that S$ is in fact 
globally mass minimizing. As a corollary, they also showed that, for any n > 8, 
there exist functions which satisfy the minimal surface equation and are not affine, 
finally settling the Bernstein problem in all dimensions. 

Not long after, Federer [44] put a bound on the size of the singular set Y(T): 


Theorem 10. The singular set A(Ty) has Hausdorff dimension at most n — 7. 
Singularities are isolated points ifn = 7. 


Bombieri, de Giorgi, and Giusti [12] showed that this bound is sharp: there exist 
mass minimizers Tp in every dimension n > 7 such that #7"~’(PA(Tp)) > 0. In the 
90’s, Simon [83] proved this singular set is well-behaved: 


Theorem 11. Except for a set of #"~' measure zero, the singularity set of a 
codimension one mass minimizer Ty is covered by a countable collection of C! 
submanifolds of dimension m — 7. 


A new and simpler proof of Simon’s theorem has been recently found by Naber 
and Valtorta [71]. To the best of our knowledge, however, it is still an open question 
whether or not the remainder of the singularity set stratifies as lower-dimensional 
submanifolds. 

Surprisingly, codimension one mass minimizers do not have boundary singulari- 
ties, as Hardt and Simon [59] established: 


>This story was recently communicated by Simons to the second author. 


286 J. Harrison and H. Pugh 


Theorem 12. If T) € .%,(R"*'), dT = [[S]] for some oriented embedded C? 
submanifold S C R", and the mass of To is minimal among all integral currents 
with the same boundary, then there exists an open neighborhood V of S such that 
VM supp(To) is an embedded C'“ hypersurface with boundary for all0 < a <1. 


The story is more complicated and incomplete in higher codimension. Almgren 
in his 1700 page “big regularity paper” [10] proved the following theorem: 


Theorem 13. The singular set P(Ty) of an m-dimensional mass-minimizing inte- 
gral current Ty in R" has Hausdorff dimension at most m — 2. 


Again, this bound is sharp in codimension > 2 [42]. Chang [20] built upon 
Almgren’s work to show that if m = 2, then the singularity set consists of isolated 
branch points. More recently, in a series of papers [30-34], De Lellis and Spadaro 
took on the monumental task of modernizing and simplifying Almgren’s work. For 
an excellent overview of their approach, see [28]. 


5 Non-orientable Generalized Surfaces 


Much of the above story can be repeated using chains with coefficients in a finite 
group, and in particular in Z/2Z to account for non-orientable surfaces. Fleming 
[47] has a beautiful theory of flat chains with coefficients (see also [95]), the 
homology of which recovers the mod-p homology of the ambient space. 

However, soap films that occur in nature are not only non-orientable, but possess 
singularities such as triple junctions which are not amenable to mass-minimization. 
To ensure that the triple junction not be part of the algebraic boundary, one must 
assign one of the three surfaces a higher multiplicity. This in turn increases the total 
mass of the surface, and as a result triple junctions do not show up in solutions to 
the mass minimization problem. 

To get around this issue, there is a different approach one can take, and that 
is to ignore multiplicity when measuring area. Instead of minimizing mass, one 
can instead minimize size, which for an integral current >; k; /; x, 18 the quantity 
>); #*(K;). The k-dimensional size of an arbitrary subset E C M is just 7*(E). 
Note that the size of an integral current may be smaller than the size of its support. 
The primary difficulty with working with size is that unlike mass, it is not weakly 
lower semicontinuous. Extreme care must be taken with the minimizing sequence 
to account for this. The payoff is that size is better suited to the study of soap films 
than mass. 


5.1 Reifenberg’s 1960 Paper 


The same year that Federer and Fleming’s seminal paper appeared [45], Reifenberg 
published a work [77] which dealt with the Plateau problem for non-orientable 
manifolds of arbitrary genus. This paper is also famous for a result that later became 
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known as “Reifenberg’s disk theorem,” which placed sufficient conditions on the 
approximate tangencies of a surface to guarantee that it was a topological disk. A 
set satisfying these conditions is now known as Reifenberg flat. Reifenberg was 
well known for his prowess with tricky if not quirky estimates, and indeed his 
disk theorem did not disappoint: a condition involved in the statement required 
€ = 7-200’, Reifenberg’s paper has sparked a number of subsequent results: 
Almgren provided a generalization to so-called “elliptic integrands” in [7]. Morrey 
generalized Reifenberg’s result to ambient manifolds in [65] (see also [66]). 

Reifenberg’s approach to proving his main theorem was built on work by 
Besicovitch and was purely set-theoretic, not involving any fancy machinery such 
as currents or varifolds. His main result was the following: 

Consider a finite collection A of pairwise disjoint Jordan curves in R*. A compact 
subset X of IR? is said to be a surface spanning A if X can be written as an increasing 
union of manifolds X; with boundary, such that for each i, there exists a manifold Y; 
with boundary A U 0X; such that Y; > A in the Hausdorff distance. Reifenberg then 
proved: 


Theorem 14. There is a surface spanning A of least area. 


His class of surfaces included those with infinite genus such as in Fleming’s 
example (Fig. 4), and non-orientable surfaces as well. However, it did not include 
soap-film type surfaces with triple junctions (Fig.9b). Reifenberg proved a sec- 
ondary result which did minimize amongst this larger category, in the case that 
A C R" is homeomorphic to the (m — 1)-sphere: 


Fig. 9 Depending on the configuration of the boundary wire, any of (b), (c) or (d) can have smaller 
area. (a) The boundary of a triple Mébius band. (b) A Triple Mobius band. (c) A non-orientable 
embedded surface. (d) An immersed disk 
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Theorem 15. /f A is a topological (m — 1)-sphere in R", 2 < m < n and Y* is 
the collections of all compact sets X D> A which do not retract onto A, then there 
exists a set X € Y* with least m-dimensional Hausdorff spherical measure. Any 
such minimizer is locally Euclidian almost everywhere. 


Both of these theorems were special cases of a general result involving “surfaces 
with algebraic boundary,” which were defined by Reifenberg and developed by 
Adams in an appendix of [77]. 

Let G be a compact® abelian group and suppose A is a compact subset of R” 
with #2”—'(A) < oo. Suppose L is a subgroup of the (m — 1)-dimensional Cech 
homology An 1(A; G) of A with coefficients in G. We say that a compact set X D A 
is a surface with a aber boundary > L if L is in the kernel of the inclusion 
homomorphism ts : Hn—1(A; G) > An (X; G). Reifenberg proved existence of a 
surface with algebraic boundary D L with least m-dimensional Hausdorff spherical 
measure. The case that G = Z/2Z implies Reifenberg’s first theorem, and the case 
that G = S' implies, via a theorem of Hopf, the second. 

A shortcoming of Reifenberg’s theory is that for boundaries more general than 
a sphere, he did not defined a single, unifying collection surfaces with soap-film 
singularities. For example, consider the disjoint union of a disk and a circle in R?. 
There is no retraction to the pair of circles, yet we might not want to consider this 
as an admissible spanning set. As another example, consider the surfaces X;, i = 
1,2, 3, in Fig. 10. Any one could be a surface with minimal area, depending on the 
distance between the circles, but a simple computation shows there is no non-trivial 
collection of Reifenberg surfaces which contains all three simultaneously. Thus, 
one would have to find an appropriate subgroup L which would produce the correct 
minimizer, and this task would change depending on the configuration of the circles 
in the ambient space. 
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Fig. 10 Three minimal surfaces spanning three circles 


©The exactness axiom is used in the proof and Cech homology only satisfies exactness when the 
coefficients are compact, so we will need this assumption. 
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In a recent paper [55, 56], the authors found a way around this problem using 
linking number to define spanning sets, and later in [58], using Cech cohomology 
in higher codimension. We also generalized Reifenberg’s result so as to minimize a 
Lipschitz density functional. 


5.1.1 Spanning Sets via Linking Numbers 


Definition 1. Suppose A is an (n—2)-dimensional compact orientable submanifold 
of R”", n > 2. We say that a circle S embedded in R” \ A is a simple link of A if 
the absolute value of the linking number L(S,A;) of S with one of the connected 
components A; of A is equal to one, and L(S,A;) = 0 for the other connected 
components A; of A, j # i. We say that a compact subset X C R"” spans A if 
every simple link of A intersects X (see Fig. 11). 


S239® 
Ode 
O® 


Fig. 11 Each row of surfaces depicts a distinct type of minimal surface spanning the Borromean 
rings. A surface in the first row spans the Borromean rings using any linking test. That is, every 
simple link of any number of curves must meet the surface. In the second row, every simple link 
of one curve or all three curves must meet the surface. The third row has mixed types 
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If A is a topological (n — 2)-sphere then the set of spanning surfaces is the 
same as the collection Y* above. Any orientable (n — 1)-manifold with boundary 
A spans A. The set A can be a frame such as the (n — 2)-skeleton of an n-cube, in 
which case one can specify (nm — 2)-cycles which the simple links need to link. This 
procedure generalizes to higher dimension using linking spheres, or alternatively, 
via Alexander duality, to Cech cohomology. 

This idea was first proposed for connected smooth boundaries in the exis- 
tence paper of [54] which was followed by the more substantial [55, 56] which 
established lower semicontinuity of Hausdorff measure for a minimizing sequence 
X; — Xo in codimension one and applied to any number of boundary components. 
One of us (HP) realized that linking number tests could be naturally viewed a 
cohomological spanning condition in higher codimension,’ and while we were 
writing this generalization [58], two papers appeared [35] and [37] which built upon 
our linking number test for spanning sets. See [57] for a new homotopy spanning 
condition, building on [35] and [37]. 

The cohomological spanning condition is stated as follows: if L is a subset of the 
(m—1)-st (reduced) Cech cohomology group Hn"! (A; G) (G need not be compact), 
we say that X D A is a surface with (algebraic) coboundary D L if L is disjoint 
from the image of 1* : Ar! (X;G) > H”"—(A; G). 

One of the primary benefits of using this definition over the covariant “surface 
with algebraic boundary” is that if A is an oriented manifold, then there is a natural 
choice for the subset L, namely the collection L” of those cocycles on A which 
evaluate to 1 on the fundamental cycle of a particular component of A, and zero 
on the rest. By naturality of the Alexander duality isomorphism, the collection of 
surfaces with coboundary > L” is equivalent in codimension one to the collection 
of compact sets which span A in the sense of linking number. 

Eight years after [77] was published, Almgren proposed an extension [7] of 
Reifenberg’s theorem to prove the existence of surfaces which minimize not only 
area, but area weighted by a density function which is permitted to vary in both 
spacial and tangential directions, subject to an ellipticity condition. To discuss 
Almgren’s papers [7] and [9] we will need to introduce rectifiable sets and varifolds, 
which we now define. 


5.2 Rectifiable Sets 


A subset E of R” is m-rectifiable if there exist a countable collection of Lipschitz 
maps {f; : R” — R"} such that the m-dimensional Hausdorff measure of 
E \ Uofi(R”) is zero. If E is 4°” measurable and #"(E) < oo, then the maps 
fi can be taken to be C!. Such sets are the higher dimensional analog of rectifiable 
curves. The defining property of a rectifiable set is that it is equipped with a unique 


7Similar “surfaces with coboundary” were discovered independently by Fomenko [49]. 
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“approximate tangent m-plane” almost everywhere, a consequence of Rademacher’s 
theorem. If these approximate tangent spaces are equipped with an orientation, 
it becomes possible to integrate m-forms: An integer rectifiable current is just an 
integer weighted rectifiable E together with an orientation, i.e. an 4” measurable 
field of m-vectors on E such that for almost every p € E, the m-vector at p is unit 
simple in the direction of the approximate tangent space at p. 

A set F is purely m-unrectifiable if /"(F N E) = 0 for every m-rectifiable 
set E. Every subset of R” with finite #7” measure can be written, uniquely up to 
# measure zero sets, as the disjoint union of a m-rectifiable set an a purely m- 
unrectifiable set. The beautiful Besicovitch-Federer structure theorem says that if 
F is purely m-unrectifiable, then for almost every m-plane V in the Grassmannian 
Gr(m, n), the orthogonal projection of F onto V has .#”" measure zero. 

Morally, every subset of Euclidian space can be decomposed almost uniquely 
into a countable collection of C! submanifolds, and a remainder which casts no 
shadows. 


5.3 Integral and Stationary Varifolds 


Varifolds were first introduced by Young [93, 94] as “generalized surfaces” and 
developed by Young and Fleming [46, 51]. Fleming, who had been Young’s student, 
in turn, taught Almgren what he knew [48] when Almgren was a student at Brown 
1958-62. Almgren took an interest in generalized surfaces and changed the name to 
“varifolds,’ a mnemonic for manifolds in the calculus of variations. He produced 
a set of mimeographed notes [4] on varifolds that were circulated amongst his 
students but never published. Allard, who had been Fleming’s student, produced the 
definitive reference on varifolds [2] in which he proved the compactness theorem 
for integral varifolds. 


Definition 2. Let M be a smooth n-dimensional Riemannian manifold. A k-varifold 
V in M is a Radon measure on the total space of the Grassmannian bundle z : 
Gr — M, whose fiber above a point p € M is the Grassmannian of un-oriented 
linear k-planes in T,M. The pushforward of V by z is denoted ||V||. The mass of V 
is the quantity ||V||(47). The support of V is the support of the measure ||V]|. 


For example, an embedded k-dimensional submanifold S C M, together with a 
H¢*-measurable function 9 : S > R* determine a varifold V as follows: 


V(A) := / 6(p)d HK" (p). 
SN{p:(p,T,S) EA} 


More generally, S can be replaced by a k-rectifiable set. In this case V is called 
a rectifiable k-varifold. If 0 takes integer values, the varifold is called an integral 
varifold. Integral varifolds are the non-orientable analogs of integral currents. There 
is no notion of integration of differential forms on a varifold, and unlike currents 
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the space of varifolds does not possess a boundary operator. However, integral k- 
varifolds can be pushed forward by a Lipschitz map. 

Almgren saw these features as an advantage, for he wanted to model 
non-orientable surfaces and those with triple junctions. In [5] Almgren 
credits Federer-Fleming for proving the Plateau problem for mass minimization 
of oriented surfaces, and Reifenberg for size minimization of non-oriented surfaces 
“subject to certain topological restraints.’ Almgren sought at this time to prove a 
mass minimization result for non-oriented surfaces. He did not specify what it meant 
for a varifold to span a given contour in [4] or [5], but his focus at the time was on 
stationary varifolds where the definition seemed self-evident as we shall next see. 

The first variation 5V of a compactly supported varifold V is a function which 
assigns to a smooth compactly supported vector field Y on M the rate of change of 
the mass of the pushforward of V by the time-t map of the flow of Y att = 0.A 
varifold is stationary if 6V = 0. 

In [4], Almgren proved the following theorem: 


Theorem 16. Let M be a smooth compact n-dimensional Riemannian manifold. 
For each 0 < k <n there exists a stationary integral k-varifold in M. 


Allard in [2] proved a beautiful regularity result for such varifolds: 


Theorem 17. Jf V is a stationary integral k-varifold in a smooth compact n- 
dimensional Riemannian manifold M, 0 < k <_n, then there is an open dense 
subset of the support of V which is a smooth k-dimensional minimal submanifold 
of M. 


Even many years later, this theorem remains state-of-the-art in terms of what 
is known about the singularity set of stationary varifolds. For example, it is not 
known if the singularity set has zero Hausdorff measure in dimension k. Indeed, 
regularity theory for stationary varifolds is still at an early stage, even compared to 
what is known about mass minimizing integral currents in higher codimension. See 
[17] where Bombieri mentions this problem, as well as [29] for an accounting of 
progress. 


5.4 Elliptic variational problems 


In [7] Almgren initiated the study of elliptic variational problems for non-orientable 
surfaces by providing the first definition of an elliptic integrand and a proof of 
regularity, depending on the degree of smoothness of the integrand. His definitions 
and main regularity result follow: 

Let A be a compact (m — 1)-rectifiable subset of R” with #7"~'(A) < 1,G 
a finitely generated abelian group, and o0 € H,,(R", A: G). We say a compact 
m-rectifiable set X D A is a surface which spans o if o is in the kernel of the 
homomorphism on homology induced by the inclusion (R", A) @ (R", X). 

A C* (resp. real analytic) integrand is a C* (resp. real analytic) function f : 
R” x Gr(m, n) > [a, b], where 0 < a < b <1. We say f is elliptic with respect to G 
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if there exists a continuous function c : R" > RN {t: t > O} such that if D C R" 
is an m-disk, t € H,,(IR", 0D; G) \ {0}, and D is any surface which spans tT, then 


[ f(&, TyD) dH" (y) — i f(x, TyD) dH" (y) > cx) (H"(D) — 42" (D)) 


for all x € R”. 


Theorem 18. Let 3 < k <1andGéG. Iff is a C* (resp. real analytic) integrand, 
elliptic with respect to G, and S is a surface which spans o such that 


/ fx, T,S) d(x) < / fle, T.T) 4") 
Ss T 


for all surfaces T which span o, then S is #2" almost everywhere a C' (resp. real 
analytic) submanifold of R". 


The authors proved there exists such a surface S. These results marked a 
significant advance® over Reifenberg’s paper which only dealt with the functional 
f = 1, and bring Plateau’s problem, which had grown well beyond the classical 
theory of minimal surfaces and the minimal surface equation, squarely back into the 
realm of PDE’s. 


5.4.1 (f,€, 6)-minimal sets 


In his memoir [9] (see also [8]), Almgren defined new classes of surfaces to 
model soap bubbles as well as many types of soap films. His regularity theory for 
minimizers in such a class is often cited. 

Fix A C R". If @ : R" — R’ is Lipschitz, let Wy = {x : O(X) F 4}. If 
W,»U@(Wg) is disjoint from A and contained in a ball of radius 6 for some 0 < 6 <1, 
we say that ¢ is a §-deformation fixing A. 

Let 1 < y < oo. Acompact set X C R” with #"(X) < 0 is (y, 8)-restricted 
with respect to A if 


FOX Wg) < YFE"(O(X Wyo) 


for all 6-deformations ¢ fixing A. 


8Readers should be warned that the existence portion of [7] contains a serious gap. Briefly, a 
minimizing convergent sequence for a bounded elliptic integrand does not automatically yield a 
uniformly quasiminimal subsequence, but [7] assumes that it does. This is a critical part of the 
argument for existence of a minimizer (see [57] for a more detailed discussion.) In [9] spanning 
surfaces are chosen to be a priori uniformly quasiminimal so that the problem disappears. However, 
he was not able to prove a general existence theorem (see [67] for further details.). 
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If € : [(0,0c0) — [0, oo) with €(0) = 0 is a continuous non-decreasing function, 
the set X is called (f, €, 8)-minimal if in addition for every r-deformation ¢ fixing 
A,O<r<6é, 


/ Fx, T,X) d(x) < (1 + €(0)) , Fle, Tp (X)) d"(x). 
x o(X) 


An important fact about (y, 6)-restricted sets is that they are m-rectifiable [43, 
3.2.14(4)] and have both upper and lower bounds on density ratios. Almgren proved 
regularity results a.e for (f, €, 5) minimal sets in [9]: 


Theorem 19. Suppose f is elliptic and C? and X is (f ,€,5)-minimal with respect to 
A where € satisfies 


1 
i t 9% (1) /2dt < 00 
0 


for some 0 < a < 1. Then there exists an open set U C R" such that 7" (X\U) = 0 
and X QU is a C! m-dimensional submanifold of R". 


Almgren states however that “These hypotheses and conclusions, incidentally, do 
not imply that $M U locally can be represented as the graph of a function which 
satisfies any of the various Euler equations associated with f.” Indeed, any C? m- 
dimensional submanifold S with boundary is? (M,¢,65) minimal with respect to 0S 
if 5 is sufficiently small and ¢ is a linear map with large slope. 

For the three-dimensional case, Taylor [86] relied upon Theorem 19 to prove a 
beautiful soap film regularity result for (M/, ¢, 5)-minimal sets. However, Morgan 
[67] points out that the class of (/, 0, 5)-minimal sets, taken over all 6 > 0, is not 
compact. It remains an open problem of whether a smoothly embedded closed curve 
in R? bounds a film with minimal area in the class of all (M, 0, 5)-minimal sets. 

Another open problem motivated by [35, 37] and [55, 56] is to prove the same 
regularity theorems as above in the case that ¢ is also required to be uniformly close 
to a diffeomorphism. 


6 Variable Boundaries 


6.1 Sliding Boundaries 


The notion of a sliding boundary has had a long history in the study of elasticity 
in mechanical engineering (see Sect. 24 of [73], for example). David brought the 
attention of this problem to those in geometric measure theory [26]. We shall 
mention a formulation of the problem found in [35] and [37] which was influenced 


°Here and in the literature, M denotes the constant function 1. 
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by [26]. These works assume that the bounding set A has zero .#” measure and 
often (m — 1)-rectifiable, but others do not (see, e.g., [23, 50]) as applications often 
require a large, and even rough, bounding set. 


Definition 3. Let A C R” be compact and S, C R” \ A be relatively compact. 
Let Lip(A) denote the collection of Lipschitz maps ¢ : R” — R" such that there 
exists a continuous map © : [0, 1] x R” > R” with O(1,-) = ¢, O(0,-) = Id and 
@O(t,A) C A for each t € [0, 1]. Define 


G(A, Sx) = {S: S = $(Ss) for some @ € Lip(A)} 


and call S,. a sliding minimizer if 7” (S.) = inf{#7"(S):S € @(A, Sx)}. 


Note that @(A,S.) does not form an equivalence class. It is not known if 
@(A,S«) is compact. However it can be shown with some assumptions on A 
(see [35] for codimension one, [37] for higher codimension) that if {S,} C 
@(A, Sx) is a minimizing sequence, then the measures .#”|s, converge weakly to a 
measure g.#"|s, where So is m-rectifiable and g > 1. In particular, #’"(So) < 
lim inf #7" (S;). It is not known if So € @(A,S.), but [35] and [37] proved 
nonetheless that So is a sliding minimizer. We address a slightly different sliding 
boundary problem in [57]. 

It is an open question if this result extends to Lipschitz or elliptic integrands. It 
is similarly open to prove the result if the aforementioned assumptions on A (e.g., 
SE" (A) < 00) are removed. 


6.2. Euler-Plateau Problem 


Mahadevan and Giomi [53] proposed a type of Plateau problem in which a rigid 
boundary is replaced by a soft boundary such as a flexible wire. Specifically, in the 
language of Kirchhoff’s theory of rods [38], permissible boundaries are circular rods 
which resist bending yet are inextensible, unshearable, without intrinsic curvature, 
and without resistance to twisting about their centerlines. Mahadevan and Giomi 
formulated an energy functional which measured not only the area of a spanning 
surface, but also the energy of the boundary. The resulting Euler-Lagrange equations 
are equivalent, in the zero surface tension case, to those derived by Langer and 
Singer [64] (see [41]). This minimization problem is called the Euler-Plateau 
problem after Euler’s study of column buckling, but might more appropriately be 
called the Kirchhoff-Plateau problem.'° 

Chen and Fried [41] rigorously derived the equilibrium conditions for the 
minimization problem, and provided geometric and physical interpretations of these 
conditions. Briefly, the surface on the interior must have zero mean curvature, and 
the boundary is required to bend elastically in response to a force exerted by the 


'0The authors would like to thank Eliot Fried for helpful remarks. 
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spanning film. The class of competitors for minimization are those surfaces which 
occur as images of the disk. However, since the boundary is permitted to vary, 
the maps cannot, in contrast to Douglas, Radé and Courant, be assumed to be 
conformal. See also [16, 50]. 

These papers are closely related to earlier work by Bernatzki [15] and Bernatzki- 
Ye [19]. 


7 Open Problems 


Though we have labeled the following problems as “open,” some may have been 
solved without our knowledge. If you have solved one of these, please accept our 
apologies (and our congratulations!) 


7.1 Classical Minimal Surfaces 


The following problems are part of a longer list in [68]. Let C be the space of 
connected, complete, embedded minimal surfaces and let P C C be the subspace of 
properly embedded surfaces. 


¢ Isolated Singularity Conjecture (Lawson and Gulliver): The closure of a properly 
embedded minimal surface in the punctured closed unit ball is a compact 
embedded minimal surface. 

¢ Convex Curve Conjecture (Meeks): Two convex Jordan curves in parallel planes 
cannot bound a compact minimal surface of positive genus. 

* 4z-Conjecture (Meeks, Yau, Nitsche): If I” is a simple closed curve in R? with 
total curvature at most 47, then J” bounds a unique compact, orientable, branched 
minimal surface and this unique minimal surface is an embedded disk. 

¢ Liouville Conjecture (Meeks): If M € P andh: M — Ris a positive harmonic 
function, then / is constant. 

¢ Finite Genus Properness Conjecture (Meeks, Pérez, Ros): If M € C and M has 
finite genus, then M ¢€ P. 


7.2 Integral Currents 


These problems are adapted from a longer list in [11]. 


e Establish the uniqueness of tangent cones to an mass-minimizing current. 
Uniqueness for 2-dimensional currents was proved in [90], and partial results 
in the general case in [1] and [82]. 
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* Does the singular set of a mass-minimizing current have locally finite 7”? 
measure? Chang [20] proved that it does if m = 2. 

e Is the singular set of an mass minimizing current rectifiable? Does it have other 
geometric structure such as a stratification? See e.g., Theorem 11. 


7.3 Reifenberg Problems 


Reifenberg posed ten open problems in [77]. Three of particular interest are these: 


¢ Let M be a manifold with boundary and D,; be discs with boundary 0M. Let 
it be the infimum of the areas of discs with boundary 0M. Suppose Dy — M 
and .#?(D,) > #7(M) = Uw. Prove that M is a disc. Prove the same for m- 
dimensional disks. 

¢ Generalize Theorem 2 of [77] to the case where the boundary is any manifold. 
For example, let A be the 2-torus and X the solid torus with a small interior 
ball removed. Then A is not a retract of X, as one can deformation retract X 
onto the union of A and a transverse disk. If the torus is made to be narrower in 
some region, the solution to the generalized Theorem 2 in this case would be a 
transverse disk at the narrowest location. 

e Find a class of surfaces which includes those such as the Adams example which 
retract onto their boundary, and also includes some class of deformations thereof; 
then prove a compactness theorem for such surfaces. Do those sets which do not 
admit a deformation retraction onto the boundary forms such a class? 


7.4 Elliptic integrands 


e Show by example that interesting non-smooth solutions can arise which represent 
observed phenomena in nature if an elliptic integrand is not smooth. 

¢ Prove a version of the main result in [57] for mass, instead of size, weighted by 
an elliptic density functional. 

¢ What restrictions on the competing class of surfaces can be made that carry 
over to minimizing solutions? E.g., one can restrict the problem to graphs, disks, 
continuous embeddings, bordisms, topological type, etc. Each problem presents 
its own existence and regularity questions. 

e Axiomatic approach: Let .Y be a collection of surfaces such that if S € 7% 
and @ is a Lipschitz map fixing A which is C° close to a diffeomorphism, then 
o(S) € A. What are minimal conditions needed on .” to guarantee existence of 
a minimizer for an elliptic integrand in S? See [35-37, 57]. 
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7.5 Non-closed Curves 


¢ Find models for surfaces spanning non-closed curves and prove a compactness 
theorem (Fig. 2b). In [58] we proposed using relative (co)homology as follows: If 
one replaces the boundary set A with a pair (A, B), the definition of a surface with 
coboundary can be repeated: A pair (X, Y) > (A, B) is a surface with coboundary 
> L if L is disjoint from the image of .* : H"—'(X, Y) > H”~!(A, B). To what 
extent can this be adapted if B is permitted to vary in some restricted fashion? 
See also [40] and [67, 11.3]. 


7.6 Varifolds 


* Does a smoothly embedded closed curve in R? bound a film with minimal area 
in the class of all (M, 0, 6)-minimal sets? 
¢ Does the singular set of a stationary varifold have measure zero? 


7.7 Dynamics and Deformations 


We close with three increasingly open-ended problems 


¢ Euler-Plateau for sliding boundaries: State and solve the Euler-Plateau problem 
for sliding boundaries. Not only is the bounding set B allowed to be flexible, but 
frontiers of solutions can slide around within B as it flexes. 

¢ To what extent can mean curvature flow detect soap-film solutions including 
triple junctions and non-orientable surfaces, starting with a given spanning set? 
See [18, 91, 92]. 

¢ The problem of lightning (Harrison and Pugh): Formulate a dynamic version 
of Plateau’s problem which models the formation and evolution of branched 
solutions. Applications would be numerous: lightning, formation of capillaries, 
branches, fractures, etc. One should permit boundaries with higher Hausdorff 
dimension and solve the elliptic integrand problem, as solutions would be 
branched minimizers of the corresponding action principle, and thus should be 
relevant to physics. 
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The Unknotting Problem 


Louis H. Kauffman 


Abstract This paper tells the story of knots and the search to detect their knotted- 
ness. 


1 Introduction 


We say that a closed loop embedded in three dimensional space is knotted if there 
is no continuous deformation of the loop, through embeddings, that changes it to a 
circle embedded in a plane. The fundamental problem in knot theory is to determine 
whether a closed loop embedded in three dimensional space is knotted. The loop in 
Fig. | is a trefoil knot, the simplest knot. View Fig. 2. Can you tell whether this loop 
is knotted or not? It requires a special intuition for topology to just look at a loop 
and know if it is really knotted. 

In order to analyze the knottedness of a knot, a mathematical representation is 
required. In this paper we shall use the method of knot and link diagrams, and 
the equivalence relation generated by the Reidemeister moves (see Fig.3 for an 
illustration of these moves). Knot diagrams are graphs with extra structure that 
encode the embedding type of the knot. Each diagram is a pictorial representation 
of the knot, and so appeals to the intuition of the viewer. The Reidemeister moves 
are a set of simple combinatorial moves, proved in the 1920s to capture the 
notion of topological equivalence of knots and links in three dimensional space. 
Single applications of these moves can leave the diagram with the same number 
of crossings (places where a weaving of two segments occurs), or increase or 
decrease the number of crossings. Some unknottings can be accomplished without 
increasing the number of crossings in the diagram. We call such unknot diagrams 
easy since the fact that they are unknotted can be determined by a finite search for 
simplifying moves. However, there are culprit diagrams [28, 56, 58, 60] that require 
moves that increase the number of crossings before the diagram can be simplified 


L.H. Kauffman (|) 
Department of Mathematics, Statistics, and Computer Science, 
University of Illinois at Chicago, Chicago, IL 60607-7045, USA 
e-mail: kauffman @uic.edu 


© Springer International Publishing Switzerland 2016 303 
J.F. Nash, Jr., M.Th. Rassias (eds.), Open Problems in Mathematics, 
DOI 10.1007/978-3-319-32162-2_8 


304 L.H. Kauffman 


Fig. 1 The trefoil knot 


Fig. 2. Knotted? 


to an unknotted circle with no crossings. It is the structure of such culprits that is 
the subject matter of Sect. 3 of this paper. 

This paper is an exploration of the theme of detecting knottedness. This exposi- 
tion is an outgrowth of a series of lectures [30] that I gave in Tokyo at the Knots 96 
conference in the summer of 1996. The present exposition goes considerably farther 
than those lectures, as in the intervening time there has been much progress on this 
problem. In particular there is a beautiful combinatorial solution to the problem of 
detecting whether a knot diagram is knotted, due to Dynnikov [10] that we sketch 
in Sect. 3 of this paper. 

The problem of detecting the unknot has been investigated by many people. 
The three dimensional work of Haken and Hemion [33] solves these problems in 


IT) ya 
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principle, giving definite algorithms to tell whether two knots are equivalent, or 
whether a given knot is unknotted. The algorithms are given in terms of the structure 
of a triangulation of the complement of the knot or link, and these algorithms 
are unwieldy. Nevertheless, the major problem that we have discussed is solved 
by the Haken-Hemion method. This result did not end the field of study of knot 
invariants. In fact, it spurred it on and the methods that we have discussed in 
this paper are just the tip of the iceberg of the revolutions that have engulfed the 
theory of knots and links since the early 1980s. For example, the papers by Birman 
and Hirsch [54] and Birman and Moody [55] study the unknotting problem. More 
recently it has been shown that both Khovanov Homology [17] (a generalization 
of the Jones polynomial) and Heegard Floer Homology (a generalization of the 
Alexander polynomial) detect the unknot. Heegaard Floer Homology not only 
detects the unknot, but can be used to calculate the least genus of an orientable 
spanning surface for any knot. This is an outstanding result. The reader can examine 
the paper by Manolescu et al. [62] for more information. In that work, the Heegard 
Floer homology is expressed via a chain complex that is associated to a rectangular 
diagram of just the type that Dynnikov uses. 

This paper is organized as follows. Section 2 discusses the Reidemeister moves 
and the combinatorial approach to the theory of knots and links. We sketch the 
proof of Reidemeister’s Theorem that expresses equivalence of knots and links 
in terms of diagram moves (the Reidemeister moves). We give examples of knot 
diagrams that are unknotted but must be made more complicated in order to be 
undone by a sequence of Reidemeister moves. If it were not for this phenomenon, 
the problem of detecting an unknotted diagram would be over at once. One would 
only need to try to simplify the diagram by Reidemeister moves (that is, to try to 
reduce the number of crossings in the diagram). This is a finite search and that 
would be an algorithm to determine the knottedness of the diagram. Section 2 
explains the solution by Dynnikov to this conundrum of unknotting by moves. 
We reformulate knot theory in terms of arc-diagrams and explain how Dynnikov 
constructs moves on these diagrams that can simplify an unknotted knot diagram. 
In this way, the problem of detecting the unknot is solved. There are many questions 


Fig. 3 Reidemeister moves 
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still remaining in the combinatorial domain, and the end of Sect. 3 discusses them. 
We give upper bounds on the number of crossings needed to unknot an unknotted 
diagram (work with Henrich [28] based on Dynnikov’s work) and upper bounds 
on the number of Reidemeister moves needed for unknotting. Section 4 turns to 
the question of detection of knotting by the Jones polynomial and gives examples of 
links that cannot be detected by the Jones polynomial. It is still an unsolved problem 
whether the Jones polynomial detects the unknot. Section 5 is an introduction to 
Vassiliev invariants. We include this section in order to give the reader a flavor of the 
wider context the includes the Jones polynomial with invariants that are related to 
physics and to Lie algebras. We include a description of the relationship of Vassiliev 
invariants with Witten’s approach to knot invariants via quantum field theory. We 
end this chapter with a statement of the remarkable problem to distinguish knots 
from their reverses. There are knots that are not isotopic to the curve obtained by 
reversing the orientation of the loop. At this writing it is not known if Vassiliev 
invariants can detect reversibility. 

There are many more problems about the detection of knottedness. We have not 
touched on the question of distinguishing distinct knots in this paper. It is possible 
that if two knots are not isotopic, then there are Vassiliev invariants that distinguish 
them. But that is another country. 


2 Reidemeister Moves 


Reidemeister [39] discovered a simple set of moves on link diagrams that captures 
the concept of ambient isotopy of knots in three-dimensional space. There are 
three basic Reidemeister moves. Reidemeister’s theorem states that two diagrams 
represent ambient isotopic knots (or links) if and only if there is a sequence of 
Reidemeister moves taking one diagram to the other. The Reidemeister moves are 
illustrated in Fig. 3. 

Reidemeister’s three moves are interpreted as performed on a larger diagram in 
which the small diagram shown is a literal part. Each move is performed without 
disturbing the rest of the diagram. Note that this means that each move occurs, up 
to topological deformation, just as it is shown in the diagrams in Fig. 3. There are 
no extra lines in the local diagrams. For example, the equivalence (A) in Fig. 4 is 
not an instance of a single first Reidemeister move. Taken literally, it factors into a 
move IJ followed by a move /. 

Diagrams are always subject to topological deformations in the plane that 
preserve the structure of the crossings. These deformations could be designated as 
“Move Zero”. See Fig. 4. 

A few exercises with the Reidemeister moves are in order. First of all, view the 
diagram in Fig. 5. It is unknotted and you can have a good time finding a sequence 
of Reidemeister moves that will do the trick. Diagrams of this type are produced 
by tracing a curve and always producing an undercrossing at each return crossing. 
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Fig. 4 Factorable move, Y 
move zero 


cor 
Move Zero 
Fig. 5 Standard unknot C/) 


- 


This type of knot is called a standard unknot. Of course we see clearly that a 
standard unknot is unknotted by just pulling on it, since it has the same structure 
as a coil of rope that is wound down onto a flat surface. 

Can one recognize unknots by simply looking for sequences of Reidemeister 
moves that undo them? This would be easy if it were not for the case that there are 
examples of unknots that require some moves that increase the number of crossings 
before they can be subsequently decreased. Such an culprit is illustrated in Fig. 6. 

It is generally not so easy to recognise unknots. However, here is a tip: Look 
for macro moves of the type shown in Fig. 7. In a macro move, we identify an arc 
that passes entirely under some piece of the diagram (or entirely over) and shift this 
part of the arc, keeping it under (or over) during the shift. In Fig.7, we illustrate 
a macro move on an arc that passes under a piece of the diagram that is indicated 
by arcs going into a circular region. A more general macro move is possible where 
the moving arc moves underneath one layer of diagram, and at the same time, over 
another layer of diagram. Macro moves often allow a reduction in the number of 
crossings even though the number of crossings will increase during a sequence of 
Reidemeister moves that generates the macro move. 
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Fig. 6 A culprit 2 


Fig. 7 Macro move 


As shown in Fig. 7, the macro-move includes as a special case both the second 
and the third Reidemeister moves, and it is not hard to verify that a macro move can 
be generated by a sequence of type II and type III Reidemeister moves. It is easy to 
see that the type I moves can be left to the end of any deformation. The demon of 
Fig. 6 is easily demolished by macro moves, and from the point of view of macro 
moves the diagram never gets more complicated. 

Let’s say that a knot can be reduced by a set of moves if it can be transformed by 
these moves to the unknotted circle diagram through diagrams that never have more 
crossings than the original diagram. Then we have shown that there are diagrams 
representing the unknot that cannot be reduced by the Reidemeister moves. On the 
other hand, I do not know whether unknotted diagrams can always be reduced by 
(appropriately generalized) macro moves in conjunction with the first Reidemeister 
move. If this were true it would give a combinatorial way to recognise the unknot. 
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Remark. In fact, there is a combinatorial way to recognise the unknot based on a 
diagrams and moves. In [10] I. A. Dynnikov finds just such a result, using piecewise 
linear knot diagrams with all ninety degree angles in the diagrams, and all arcs in the 
diagram either horizontal or vertical. We shall discuss Dynnikov’s work in Sect. 3 
of this paper. 


2.1 Reidemeister’s Theorem 


We now indicate how Reidemeister proved his Theorem. 

An embedding of a knot or link in three-dimensional space is said to be piecewise 
linear if it consists in a collection of straight line segments joined end to end. 
Reidemeister started with a single move in three-dimensional space for piecewise 
linear knots and links. Consider a point in the complement of the link, and an edge 
in the link such that the surface of the triangle formed by the end points of that 
edge and the new point is not pierced by any other edge in the link. Then one can 
replace the given edge on the link by the other two edges of the triangle, obtaining 
a new link that is ambient isotopic to the original link. Conversely, one can remove 
two consecutive edges in the link and replace them by a new edge that goes directly 
from initial to final points, whenever the triangle spanned by the two consecutive 
edges is not pierced by any other edge of the link. This triangle replacement 
constitutes Reidemeister’s three-dimensional move. See Fig. 8. It can be shown that 
two piecewise linear knots or links are ambient isotopic in three-dimensional space 
if and only if there is a sequence of Reidemeister triangle moves from one to the 
other. This will not be proved here. At the time when Reidemeister wrote his book, 
equivalence via three-dimensional triangle moves was taken as the definition of 
topological equivalence of links. 

It can also be shown that tame knots and links have piecewise linear represen- 
tatives in their ambient isotopy class. It is sufficient for our purposes to work with 
piecewise linear knots and links. Reidemeister’s planar moves then follow from an 
analysis of the shadows projected into the plane by Reidemeister triangle moves in 
space. Figure 9 gives a hint of this analysis. The result is a reformulation of the 
three-dimensional problems of knot theory to a combinatorial game in the plane. 

To go beyond the hint in Fig.9 to a complete proof that Reidemeister’s 
planar moves suffice involves preliminary remarks about subdivision. The simplest 
subdivision that one wants to be able to perform on a piecewise linear link is the 


Fig. 8 Triangle move 
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Fig. 9 Shadows 


placement of a new vertex at an interior point of an edge—so that edge becomes two 
edges in the subdivided link. Figure 10 shows how to accomplish this subdivision 
via triangle moves. 

Any triangle move can be factored into a sequence of smaller triangle moves 
corresponding to a simplicial subdivision of that triangle. This is obvious, since the 
triangles in the subdivision of the large triangle that is unpierced by the link are 
themselves unpierced by the link. 

To understand how the Reidemeister triangle move behaves on diagrams it is 
sufficient to consider a projection of the link in which the triangle is projected to a 
non-singular triangle in the plane. Of course, there may be many arcs of the link also 
projected upon the interior of the projected triangle. However, by using subdivision, 
we can assume that the cases of the extra arcs are as shown in Fig. 11. In Fig. 11 
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Fig. 10 Subdivision of an 
edge 


Fig. 11 Projections of 
triangle moves 
— 


we have also shown how each of these cases can be accomplished by (combinations 
of) the three Reidemeister moves. This proves that a projection of a single triangle 
move can be accomplished by a sequence of Reidemeister diagram moves. 

A piecewise linear isotopy consists in a finite sequence of triangle moves. There 
exists a direction in three-dimensional space that makes a non-zero angle with each 
of theses triangles and is in general position with the link diagram. Projecting to the 
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plane along this direction makes it possible to perform the entire ambient isotopy 
in the language of projected triangle moves. Now apply the results of the previous 
paragraph and we conclude 


Reidemeister’s Theorem If two links are piecewise linearly equivalent (ambient 
isotopic), then there is a sequence of Reidemeister diagram moves taking a 
projection of one link to a projection of the other. 

Note that the proof tells us that the two diagrams can be obtained from one spatial 
projection direction for the entire spatial isotopy. It is obvious that diagrams related 
by Reidemeister moves represent ambient isotopic links. Reidemeister’s Theorem 
gives a complete combinatorial description of the topology of knots and links in 
three-dimensional space. 


3 Dynnikov’s Solution of the Problem of Knot Detection 


We now discuss a powerful result proven by Dynnikov in [10]. Dynnikov uses, a 
diagram called an arc-presentation for the knot. We define this below, and show 
how one can detect the unknot using moves that preserve this type of presentation. 
This section is based on our paper [28]. 


Definition 1. An arc—presentation of a knot is a knot diagram comprised of 
horizontal and vertical line segments such that at each crossing in the diagram, the 
horizontal arc passes under the vertical arc. Furthermore, we require that no two 
edges in an arc—diagram are colinear. Two arc—presentations are combinatorially 
equivalent if they are isotopic in the plane via an ambient isotopy of the form 
h(x,y) = (f(@), g(y)). The complexity c(L) of an arc—presentation is the number 
of vertical arcs in the diagram. 


We say more generally that a link diagram is rectangular if it has only vertical 
and horizontal edges. In Fig. 12 we give an example of a rectangular diagram that 
is an arc—presentation and another example of a rectangular diagram that is not an 
arc—presentation. 

Note that a rectangular diagram can naturally be drawn on a rectangular grid. If 
we start with such a grid and represent rectangular diagrams on the grid we have 
called these knots mosaic knots and used them to define a notion of quantum knot. 
See [61] for more about quantum knots. For now, we focus our attention on arc— 
presentations. 


Proposition 1 (Dynnikov). Every knot has an arc-—presentation. Any two arc- 
presentations of the same knot can be related to each other by a finite sequence 
of elementary moves, pictured in Figs. 13 and 14. 


The proof of this proposition is elementary, based on the Reidemeister moves. 
One shows that each Reidemeister move can be represented by (a sequence of) 
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a) 


Fig. 12 The picture on the /eft is an example of an arc—presentation of a trefoil. The picture on the 
right is an example that is not an arc—presentation (since not all horizontal arcs pass under vertical 
arcs) 


Fig. 13. Elementary 


(de)stabilization moves. 
Stabilization moves increase kon Lany 
the complexity of the 
arc—presentation while 
destabilization moves 
— 


decrease the complexity 
——  — 
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Fig. 14 Some examples of exchange moves. Other allowed exchange moves include switching the 
heights of two horizontal arcs that lie in distinct halves of the diagram 


elementary moves. See [10]. We will show how to convert a usual knot diagram to 
an arc—presentation in the next few paragraphs, making use of the concept of Morse 
diagrams of knots. 


Definition. A knot diagram is in Morse form if it has 


. no horizontal lines, 

. no inflection points, 

. asingle singularity at each height, and 

. each crossing is oriented to create a 45 degree angle with the vertical axis. 
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Fig. 15 A Morse diagram of 
a knot and a corresponding 
rectangular diagram 


Conversion of 

a Morse diagram 
to a rectangular 
diagram. 


Fig. 16 Rotating a crossing 

to convert a rectangular | 

diagram into an — —»> Lp 
arc—presentation | 


Fig. 17 Converting a 

rectangular diagram into an 

arc—presentation by rotating a 

crossing | | | 


We note that converting an arbitrary knot diagram into a diagram in Morse 
form requires no Reidemeister moves, only ambient isotopies of the plane. More 
information about Morse diagrams can be found in [59]. See Figs. 15, 16 and 17 for 
an illustration of the process of conversion of a knot diagram to an arc diagram. 

In Fig. 18 we show the example found by Goeritz [56] in 1934 of a knot diagram 
that is unknotted, but requires Reidemeister moves that create more crossings before 
it can be simplified. In Fig. 19 we show another example of this same type. We shall 
refer to the latter as the “culprit” and analyse it below. 

Here is Dynnikov’s solution to the problem of recognizing the unknot. 


Theorem 1 (Dynnikov). /f L is an arc—presentation of the unknot, then there exists 
a finite sequence of exchange and destabilization moves 


Lol-ly-:::>Ln 


such than Ly, is trivial. 
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Fig. 18 Goeritz unknot 


Fig. 19 The culprit 


What is particularly interesting about this result is that the unknot can be 
simplified without increasing the complexity of the arc—presentation, that is, 
without the use of stabilization moves. This means that a finite search will reveal 
a diagram to be unknotted if that is the case. Furthermore, if we apply Dynnikov’s 
method to a knotted knot, it will stop on a diagram that is not a planar circle. Thus 
Dynnikov’s diagrammatic method can detect the unknot. 

We can go further and ask how large a diagram is needed to unknot a knot by 
Reidemeister moves, and how many Reidemeister moves are needed. 


Theorem 2 ([28]). Suppose K is a diagram (in Morse form) of the unknot with 
crossing number cr(K) and number of maxima b(K). Then, for every i, the crossing 
number cr(K;) is no more than (M — 2)* where M = 2b(K) 4+ cr(K) and K = 
Ko, Ki, K2,..., Kn is a sequence of knot diagrams such that K+, is obtained from 
K; by a single Reidemeister move and Ky is a trivial diagram of the unknot. 


To find our upper bound on the number of Reidemeister moves, we must first 
specify an upper bound on the number m of exchange and destabilization moves 
required to trivialize an arc—presentation. This bound will depend on the complexity 
c(L) = n of the arc—diagram L. We also must provide an upper bound on the number 
of Reidemeister moves required for a destabilization or exchange move. 

In [10], Dynnikov provides the following bounds on the number of combinatori- 
ally distinct arc—presentations of complexity n. 


Proposition 2. Let N(n) denote the number of combinatorially distinct arc- 
presentations of complexity n. Then the following inequality holds. 


N(n) < 5alln— D!P 
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Using this count on the number of distinct arc—presentations of a given size, we 
can find a bound (albeit a large one) on the number of arc—presentation moves we 
need. This is simply by virtue of the fact that any reasonable sequence of moves will 
contain mutually distinct arc—presentations that don’t exceed the complexity of the 
original, and there are a limited number of such diagrams. With this we obtain 


Theorem 3 ([28]). Suppose K is a diagram (in Morse form) of the unknot with 
crossing number cr(K) and number of maxima b(K). Let M = 2b(K) + cr(K). Then 
the number of Reidemeister moves required to unknot K is less than or equal to 


M 


2S sill = tf (f= 2). 


i=2 


We have provided several upper bounds regarding the complexity of the Reide- 
meister sequence required to simplify an unknot. The bound that Dynnikov’s work 
helps us obtain for the number of Reidemeister moves required to unknot an unknot 
is superexponential. Using a different technique, Hass and Lagarias were able to 
find a bound that is exponential in the crossing number of the diagram [57]. They 
prove the following result. 


Theorem 4 (Hass and Lagarias). There is a positive constant c,, such that for 

each n = 1, any unknotted knot diagram JY with n crossings can be transformed to 

the trivial knot diagram using at most 2°!" Reidemeister moves. In fact one can take 
Wl 

qQ= 10". 


Hass and Lagarias use the same technique to find an exponential bound for 
the number of crossings required for unknotting. For bounds of this second sort, 
the one presented here is a comparatively sharper estimate. Our bound on the 
number of Reidemeister moves required to unknot an unknot eventually becomes 
larger than the previously known bound from the above Theorem, but it does 
remain significantly smaller for knots with up to 1010" crossings. We can see 
this by assuming that the number of maxima in a Morse diagram of a knot is 
approximately the same size as the crossing number (in practice the number of 
maxima is significantly smaller than the crossing number). We can estimate the 
size of our bound by computing the quantity 


M-— 2 uly? 
5 ')°. 


This larger estimate remains smaller than the bound proposed by Hass and 
Lagarias until the knots have 1910" crossings. It is possible that these methods 
may be improved to find smaller unknotting bounds for unknot diagrams. In fact, 
polynomial bounds are now shown by Lackenby [34]. 

Let us return to the Culprit. Recall that hard unknots are difficult to unknot by 
virtue of the fact that no simplifying type I or type II Reidemeister moves and no 
type III moves are available. In Fig. 20, we picture a Morse diagram of the Culprit, a 


NIA 


Fig. 20 Undoing the culprit by Reidemeister moves 


corresponding arc-presentation and a sequence of Dynnikov moves that simplify 
it to a standard unknot. Bounds for the size of diagrams that are needed using 
Reidemeister’s moves can be deduced using Dynnikov’s work. See [10, 28]. In fact, 
we [28] derive a quadratic upper bound on the crossing number of diagrams in 
an unknotting sequence. A similar result can be found in [10]. We saw that the 
Culprit may be unknotted with ten Reidemeister moves in Fig. 21 (see also [60]). 
The maximum crossing number of all diagrams in the given Reidemeister sequence 
is 12, two more than the number of crossings in the Culprit. On the other hand, we 
can compute our upper bound on the number of crossings required for unknotting 
as follows. Since the crossing number cr(K) = 10 and the number of maxima in the 
diagram is b(K) = 5, we see that M = cr(K) + 2b(K) = 20. Thus, our bound is 
(M — 2) = 18% = 324. There is room for improvement! 

We can also use M to find our bound for the number of Reidemeister moves 
required to unknot the Culprit. 


M 1 20 
> 5ilG- 1)IP(M -2) =9)i[@- DIP. 
i=2 i=2 


The largest term in this expression is roughly 10*°, unfortunately quite a bit larger 
than ten. 

In this section we have sketched results given in more detail in [10, 28], showing 
Dynnikov’s remarkable solution to the unknotting problem. Remarkable as this 
solution is, we are not happy since we believe that better bounds on the number of 
Reidemeister moves needed to unknot a knot are surely possible, and better bounds 
on the complexity of diagrams is also possible. Could there be a simple algorithm 
in the form of a calculation from a diagram that would tell if a knot was knotted? 
This is the subject of the next section where we discuss the Jones polynomial. 
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Fig. 21 Undoing the culprit by Dynnikov moves 


4 The Bracket Polynomial and the Jones Polynomial 


Now that we have exhibited a solution to problem of the detection of the unknot, 
we turn to some unsolved problems related to other methods of detection. It is 
an open problem whether there exist classical knots (single component loops) that 
are knotted and yet have unit Jones polynomial [14]. In other words, it is an open 
problem whether the Jones polynomial can detect all knots. There do exist families 
of links whose linkedness is undetectable by the Jones polynomial [47, 48]. It is the 
purpose of this section of the paper to give a summary of some of the information 
that is known in this arena. We begin with a sketch of ways to calculate the bracket 
polynomial model of the Jones polynomial, and then discuss how to construct 
classical links that are undetectable by the Jones polynomial. 

The bracket polynomial [20—23, 25] model for the Jones polynomial [14—16, 52] 
is described by the expansion 


ae = +A) G (1) 

and we have 
(KO) = (-A? — A) (K) (2) 
(S} = CAYO) 3) 


(G) = (ABW) (4) 
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A state S of a link diagram K is obtained by choosing a smoothing for each 
crossing in the diagram and labelling that smoothing with either A or A~! according 
to the convention indicated in the bracket expansion above. Then, given a state S, 
one has the evaluation < K|S > equal to the product of the labels at the smoothings, 
and one has the evaluation ||S|| equal to the number of loops in the state. One then 
has the formula 


<K>= 33 <kK|S> dsr 


where the summation runs over the states S of the diagram K, and d = —A? —A-~?, 
This state summation is invariant under all classical and virtual moves except the 
first Reidemeister move. The bracket polynomial is normalized to an invariant fx (A) 
of all the moves by the formula f(A) = (—A*)~"") < K > where w(K) is the 
writhe of the (now) oriented diagram K. The writhe is the sum of the orientation 
signs (+1) of the crossings of the diagram. The Jones polynomial, V«(t) is given in 
terms of this model by the formula 


V(t) = f(t"). 


The state sum is part of a wider approach to invariants of knots and links that 
we do not concentrate upon in this paper. First of all, the Alexander polynomial 
[53] was the first polynomial invariant of knots and links. It was not until 1981 
that the Alexander polynomial was reformulated as a state summation [18-20]. In 
1983 Jones discovered his polynomial and showed that it satisfied a skein relation 
similar to that for the Alexander-Conway polynomial [9]. Along with this there 
arose relations with statistical mechanics [5] in the work of Jones. The bracket 
model was the first direct relationship of the Jones polynomial with statistical 
mechanics. In the wake of the discovery of the Jones polynomial came more 
skein polynomials, particularly the Homflypt polynomial [20, 35] named after the 
people who discovered it—Hoste, Ocneanu, Freyd, Lickorish, Yetter, Przytycki and 
Trawcezk, and the Kauffman Polynomial [24]. These invariants are more powerful 
than the Jones polynomial, but it is still conjectural that they detect the unknot. After 
the skein polynomials, came more algebraic state sums based on the work of Yang 
and Baxter in statistical mechanics, and then arose relationships with Lie algebras 
and gauge theoretic physics. We shall sketch some of these developments in the 
sections to follow. It remains to be seen how powerful all these new invariants are as 
detectors of knottedness, but it is known that certain families of these invariants do 
detect knottedness and those results are found in the relationships of the knot theory 
with physics 

It remains on open problem whether the Jones polynomial can detect the unknot. 
We can make the conjecture as follows: 


Knot Detection Conjecture If K is a knot diagram of one component and 
V(t) = 1, then K is equivalent by Reidemeister moves to the unknot. 
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Fig. 22 Thistethwaite’s link 


This knot detection conjecture is false for links. View Fig.22. Here we have a 
version of a link L discovered by Thistlethwaite [47] in December 2000. One can 
verify that this link is indeed non-trivial, but it has the same Jones polynomial as 
the unlink of two circles. In [48] we produce infinite families of distinct links that 
appear to be unlinked to the Jones polynomial. 


4.1 Present Status of Links Not Detectable 
by the Jones Polynomial 


In this section we give a quick review of the status of our work 

A tangle (2-tangle) consists in an embedding of two arcs in a three-ball (and 
possibly some circles embedded in the interior of the three-ball) such that the 
endpoints of the arcs are on the boundary of the three-ball. One usually depicts the 
arcs as crossing the boundary transversely so that the tangle is seen as the embedding 
in the three-ball augmented by four segments emanating from the ball, each from 
the intersection of the arcs with the boundary. These four segments are the exterior 
edges of the tangle, and are used for operations that form new tangles and new 
knots and links from given tangles. Two tangles in a given three-ball are said to be 
topologically equivalent if there is an ambient isotopy from one to the other in the 
given three-ball, fixing the intersections of the tangles with the boundary. 

It is customary to illustrate tangles with a diagram that consists in a box (within 
which are the arcs of the tangle) and with the exterior edges emanating from the 
box in the NorthWest (NW), NorthEast (NE), SouthWest (SW) and SouthEast (SE) 
directions. Given tangles T and S, one defines the sum, denoted T + S by placing 
the diagram for S to the right of the diagram for T and attaching the NE edge of 
T to the NW edge of S, and the SE edge of T to the SW edge of S. The resulting 
tangle T + S has exterior edges corresponding to the NW and SW edges of T and 
the NE and SE edges of S. There are two ways to create links associated to a tangle 
T. The numerator T is obtained, by attaching the (top) NW and NE edges of T 
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together and attaching the (bottom) SW and SE edges together. The denominator T? 
is obtained, by attaching the (left side) NW and SW edges together and attaching 
the (right side) NE and SE edges together. We denote by [0] the tangle with only 
unknotted arcs (no embedded circles) with one arc connecting, within the three- 
ball, the (top points) NW intersection point with the NE intersection point, and 
the other arc connecting the (bottom points) SW intersection point with the SE 
intersection point. A ninety degree turn of the tangle [0] produces the tangle [co] 
with connections between NW and SW and between NE and SE. One then can 
prove the basic formula for any tangle T 


<T >= ar < [0] > +r < [oo] > 


where a7 and fr are well-defined polynomial invariants (of regular isotopy) of the 
tangle T. From this formula one can deduce that 


< TN >= ard + Br 


and 
<T? >= ar + Brd. 


We define the bracket vector of T to be the ordered pair (a7, 6) and denote it 
by br(T), viewing it as a column vector so that br(T)! = (a7, Br) where v' denotes 
the transpose of the vector v. With this notation the two formulas above for the 
evaluation for numerator and denominator of a tangle become the single matrix 


equation 
4.1" dl 
= br(T). 
bed | ne) 


We then use this formalism to express the bracket polynomial for our examples. 
The class of examples that we considered are each denoted by H(T,U) where T 
and U are each tangles and H(T, U) is a satellite of the Hopf link that conforms 
to the pattern shown in Fig. 23, formed by clasping together the numerators of 
the tangles T and U. Our method is based on a transformation H(T,U) —> 
H(T,U)®, whereby the tangles T and U are cut out and reglued by certain specific 
homeomorphisms of the tangle boundaries. This transformation can be specified 
by a modification described by a specific rational tangle and its mirror image. 
Like mutation, the transformation w preserves the bracket polynomial. However, 
it is more effective than mutation in generating examples, as a trivial link can be 
transformed to a prime link, and repeated application yields an infinite sequence of 
inequivalent links. 

Specifically, the transformation H(T, U)® is given by the formula 


H(T,U)® = H(T°,U°) 
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Fig. 23 Hopf link satellite 
A(T, U) 


Fig. 24 The omega 
operations 


where the tangle operations T° and U®) are as shown in Fig.24. By direct 
calculation, there is a matrix M such that 


< H(T,U) >= br(T)'Mbr(U) 
and there is a matrix §2 such that 
br(T°) = Qbr(T) 
and 
br(T®) = Q7'br(T). 
One verifies the identity 


Q'MQ7!=M 
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Fig. 25 Applying omega 
operations to an unlink VW 
= x 
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from which it follows that < H(T,U) >®=< H(T,U) > . This completes the 
sketch of our method for obtaining links that whose linking cannot be seen by 
the Jones polynomial. Note that the link constructed as H(T®, U®) in Fig. 25 has 
the same Jones polynomial as an unlink of two components. This shows how the 
first example found by Thistlethwaite fits into our construction. 


5 Vassiliev Invariants and Invariants of Rigid Vertex Graphs 


If V(K) is a (Laurent polynomial valued, or more generally—commutative ring 
valued) invariant of knots, then it can be naturally extended to an invariant of rigid 
vertex graphs by defining the invariant of graphs in terms of the knot invariant 
via an “unfolding” of the vertex. That is, we can regard the vertex as a “black 
box” and replace it by any tangle of our choice. Rigid vertex motions of the graph 
preserve the contents of the black box, and hence entail ambient isotopies of the link 
obtained by replacing the black box by its contents. Invariants of knots and links that 
are evaluated on these replacements are then automatically rigid vertex invariants 
of the corresponding graphs. If we set up a collection of multiple replacements 
at the vertices with standard conventions for the insertions of the tangles, then a 
summation over all possible replacements can lead to a graph invariant with new 
coefficients corresponding to the different replacements. In this way each invariant 
of knots and links implicates a large collection of graph invariants. See [25, 26]. 

The simplest tangle replacements for a 4-valent vertex are the two crossings, 
positive and negative, and the oriented smoothing. Let V(K) be any invariant of 
knots and links. Extend V to the category of rigid vertex embeddings of 4-valent 
graphs by the formula (see Fig. 26) 


V(Kx) = aV(Ky) + bV(K_) + cV(Ko) 


324 L.H. Kauffman 


Fig. 26 Graphical vertex V =av + bV +cV 
formulas 


V = WP Q a4 Y 
D OD OD D 
Here K,, indicates an embedding with a transversal 4-valent vertex. This formula 
means that we define V(G) for an embedded 4-valent graph G by taking the sum 


VG= > grOy Comy(s) 
S 


with the summation over all knots and links S$ obtained from G by replacing a 
node of G with either a crossing of positive or negative type, or with a smoothing 
(denoted 0). Here i+ (S) denotes the number of positive crossings in the replacement, 
i_(S) the number of negative crossings in the replacement, and ig(S) the number of 
smoothings in the replacement. It is not hard to see that if V(K) is an ambient isotopy 
invariant of knots, then, this extension is a rigid vertex isotopy invariant of graphs. 
In rigid vertex isotopy the cyclic order at the vertex is preserved, so that the vertex 
behaves like a rigid disk with flexible strings attached to it at specific points. See the 
previous section. 

There is a rich class of graph invariants that can be studied in this manner. The 
Vassiliev Invariants [4, 6, 50] constitute the important special case of these graph 
invariants where a = +1, b = —1 and c = 0. Thus V(G) is a Vassiliev invariant if 


V(Kx) = V(Ky) — VK). 


Call this formula the exchange identity for the Vassiliev invariant V. V is said to be 
of finite type k if V(G) = 0 whenever |G| > & where |G| denotes the number of 
4-valent nodes in the graph G. The notion of finite type is of paramount significance 
in studying these invariants. One reason for this is the following basic Lemma. 


Lemma. [fa graph G has exactly k nodes, then the value of a Vassiliev invariant v, 
of type k on G, v;(G), is independent of the embedding of G. 


The Unknotting Problem 325 


Proof. The different embeddings of G can be represented by link diagrams with 
some of the 4-valent vertices in the diagram corresponding to the nodes of G. 
It suffices to show that the value of vxz(G) is unchanged under switching of a 
crossing. However, the exchange identity for vz, shows that this difference is equal 
to the evaluation of v, on a graph with k + | nodes and hence is equal to zero. This 
completes the proof. 


The upshot of this Lemma is that Vassiliev invariants of type k are intimately 
involved with certain abstract evaluations of graphs with k nodes. In fact, there 
are restrictions (the four-term relations) on these evaluations demanded by the 
topology (we shall articulate these restrictions shortly) and it follows from results of 
Kontsevich [4] that such abstract evaluations actually determine the invariants. The 
invariants derived from classical Lie algebras are all built from Vassiliev invariants 
of finite type. All this is directly related to Witten’s functional integral [52]. 


Definition. Let v; be a Vassiliev invariant of type k. The top row of v, is the set 
of values that vg assigns to the set of (abstract) 4-valent graphs with k nodes. If we 
concentrate on Vassiliev invariants of knots, then these graphs are all obtained by 
marking 2k points on a circle, and choosing a pairing of the 2k points. The pairing 
can be indicated by drawing a circle and connecting the paired points with arcs. 
Such a diagram is called a chord diagram. Some examples are indicated in Fig. 27. 


Note that a top row diagram cannot contain any isolated pairings since this would 
correspond to a difference of local curls on the corresponding knot diagram (and 
these curls, being isotopic, yield the same Vassiliev invariants. 


The Four-Term Relation (Compare [45].) Consider a single embedded graphical 
node in relation to another embedded arc, as illustrated in Fig. 28. The arc underlies 
the lines incident to the node at four points and can be slid out and isotoped over the 
top so that it overlies the four nodes. One can also switch the crossings one-by-one 


Fig. 27 Chord diagrams 
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Fig. 28 The four term 
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to exchange the arc until it overlies the node. Each of these four switchings gives 
rise to an equation, and the left-hand sides of these equations will add up to zero, 
producing a relation corresponding to the right-hand sides. Each term in the right- 
hand side refers to the value of the Vassiliev invariant on a graph with two nodes 
that are neighbors to each other. 

There is a corresponding 4-term relation for chord diagrams. This is the 4-term 
relation for the top row. In chord diagrams the relation takes the form shown at the 
bottom of Fig. 28. Here we have illustrated only those parts of the chord diagram 
that are relevant to the two nodes in question (indicated by two pairs of points on 
the circle of the chord diagram). The form of the relation shows the points on the 
chord diagram that are immediate neighbors. These are actually neighbors on any 
chord diagram that realizes this form. Otherwise there can be many other pairings 
present in the situation. 

As an example, consider the possible chord diagrams for a Vassiliev invariant 
of type 3. There are two possible diagrams as shown in Fig. 29. One of these has 
the projected pattern of the trefoil knot and we shall call it the trefoil graph. These 
diagrams satisfy the 4-term relation. This shows that one diagram must have twice 
the evaluation of the other. Hence it suffices to know the evaluation of one of these 
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Fig. 29 The four term 
relation for a type three 
invariant 7 


Fig. 30 Trefoil graph 


two diagrams to know the top row of a Vassiliev invariant of type 3. We can take 
this generator to be the trefoil graph 

Now one more exercise: Consider any Vassiliev invariant v and let’s determine 
its value on the trefoil graph as in Fig. 30. 

The value of this invariant on the trefoil graph is equal to the difference between 
its values on the trefoil knot and its mirror image. Therefore any Vassiliev invariant 
that assigns a non-zero value to the trefoil graph can tell the difference between the 
trefoil knot and its mirror image. 
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Example. This example shows how the original Jones polynomial is composed of 
Vassiliev invariants of finite type. Let V(t) denote the original Jones polynomial 
[14]. Recall the oriented state expansion for the Jones polynomial [27] with the 
basic formulas (6 is the loop value.) 


Vey = 1? Vx, _ tVKoo 
Vex_= —t Vx, = t | VK... 


6= a(p? + i), 
Let t = e*. Then 


Vy = = Pe. = eVKo5 
Ve_ = =e Ay. = e VK: 


5 = ar ats oo). 
Thus 
Vx, = Vex — Ve_ = —2sinh(x/2)Vx, — 2sinh(x) VK... 


Thus x divides Vx,, and therefore x* divides Vg whenever G is a graph with at 
least k nodes. Letting 


Vale") = Yo (Gx, 


k=0 


we see that this condition implies that v,(G) vanishes whenever G has more than 
k nodes. Hence the coefficients of the powers of x in the expansion of Vx(e*) are 
Vassiliev invariants of finite type! This result was first observed by Birman and Lin 
[6] by a different argument. 


Let’s look a little deeper and see the structure of the top row for the Vassiliev 
invariants related to the Jones polynomial. By our previous remarks the top row 
evaluations correspond to the leading terms in the power series expansion. Since 


§ = —(e/? + e/?) = —2 + [higher], 
—e*/? 4 e*/? — —x + [higher], 
—e'+e~* = —2x + [higher], 


it follows that the top rows for the Jones polynomial are computed by the recursion 
formulas 
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(Kx) = —v(Ko) — 2V(Koo) 
v([loop]) = —2. 


The reader can easily check that this recursion formula for the top rows of 
the Jones polynomial implies that v3 takes the value 24 on the trefoil graph and 
hence it is the Vassiliev invariant of type 3 in the Jones polynomial that first detects 
the difference between the trefoil knot and its mirror image. 

This example gives a good picture of the general phenomenon of how the 
Vassiliev invariants become building blocks for other invariants. In the case of 
the Jones polynomial, we already know how to construct the invariant and so it 
is possible to get a lot of information about these particular Vassiliev invariants 
by looking directly at the Jones polynomial. This, in turn, gives insight into the 
structure of the Jones polynomial itself. 


5.1 Lie Algebra Weights 


Consider the diagrammatic relation shown in Fig. 31. Call it (after Bar-Natan [4]) 
the STU relation. 


Lemma. STU implies the 4-term relation. 
Proof. View Fig. 32. 


STU is the smile of the Cheshire cat. That smile generalizes the idea of a Lie 
algebra. Take a (matrix) Lie algebra with generators T’. Then 


TT? =: TT! = ifabeT 
expresses the closure of the Lie algebra under commutators. Translate this equation 


into diagrams as shown in Fig. 33, and see that this translation is STU with Lie 
algebraic clothing! 


Fig. 31 The STU relation 
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Fig. 32. A diagrammatic 
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Fig. 33 Algebraic clothing 


Here the structure tensor of the Lie algebra has been assumed (for simplicity) 
to be invariant under cyclic permutation of the indices. This invariance means that 
our last Lemma applies to this Lie algebraic interpretation of STU. The upshot is 
that we can manufacture weight systems for graphs that satisfy the 4-term relation 
by replacing paired points on the chord diagram by an insertion of T“ in one point 
of the pair and a corresponding insertion of T“ at the other point in the pair and 
summing over all a. The result of all such insertions on a given chord diagram is a 
big sum of specific matrix products along the circle of the diagram, each of which 
(being a circular product) is interpreted as a trace. 

Let’s say this last matter more precisely: Regard a graph with k nodes as obtained 
by identifying k pairs of points on a circle. Thus a code such as 1212 taken in cyclic 
order specifies such a graph by regarding the points 1,2,1,2 as arrayed along a 
circle with the first and second 1’s and 2’s identified to form the graph. Define,for a 
code a) d2... dm 


wt(a|az...m) = trace(T T?T" ...T%”) 


where the Einstein summation convention is in place for the double appearances of 
indices on the right-hand side. This gives the weight system. 
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Fig. 34 Weight system and 
Casimir insertion 
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The weight system described by the above procedure satisfies the 4-term relation, 
but does not necessarily satisfy the vanishing condition for isolated pairings. This 
is because the framing compensation for converting an invariant of regular isotopy 
to ambient isotopy has not yet been introduced. We will show how to do this in the 
course of the discussion in the next paragraph. The main point to make here is that 
by starting with the idea of extending an invariant of knots to a Vassiliev invariant of 
embedded graphs and searching out the conditions on graph evaluation demanded 
by the topology, we have inevitably entered the domain of relations between Lie 
algebras and link invariants. Since the STU relation does not demand Lie algebras 
for its satisfaction we see that the landscape is wider than the Lie algebra context, 
but it is not yet understood how big is the class of link invariants derived from Lie 
algebras. 

In fact, we can line up this weight system with the formalism related to the knot 
diagram by writing the Lie algebra insertions back on the 4-valent graph. We then 
get a Casimir insertion at the node. See Fig. 34. 

To get the framing compensation, note that an isolated pairing corresponds to the 
trace of the Casimir. Let y denote this trace. See Fig. 34. 


y =tr()_\ TT") 


Let D be the trace of the identity. Then it is easy to see that we must compensate 
the given weight system by subtracting (y/D) multiplied by the result of dropping 
the identification of the two given points. We can diagram this by drawing two 
crossed arcs without a node drawn to bind them. Then the modified recursion 
formula becomes as shown in Fig. 35. 
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Fig. 35 Modified recursion 
formula 
= a 
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(Fierz Identity) 
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For example, in the case of SU(N) we have D = N, y = (N? — 1)/2 so that we 
get the transformation shown in Fig. 35, including the use of the Fierz identity. 

For N = 2 the final formula of Fig. 35 is,up to a multiple, exactly the top row 
formula that we deduced for the Jones polynomial from its combinatorial structure. 


5.2 Witten’s Functional Integral and Vassiliev Invariants 


In [52] Edward Witten proposed a formulation of a class of 3-manifold invariants as 
generalized Feynman integrals taking the form Z(M) where 


Z(M) = | eesplcik/4x)s(.A)) 


Here M denotes a 3-manifold without boundary and A is a gauge field (also called 
a gauge potential or gauge connection) defined on M. The gauge field is a one-form 
on a trivial G-bundle over M with values in a representation of the Lie algebra of 
G. The group G corresponding to this Lie algebra is said to be the gauge group. 
In this integral the “action” S(M, A) is taken to be the integral over M of the trace of 
the Chern-Simons three-form CS = AdA + (2/3)AAA. (The product is the wedge 
product of differential forms.) 
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Z(M) integrates over all gauge fields modulo gauge equivalence (see [2] for a 
discussion of the definition and meaning of gauge equivalence.) 

The formalism and internal logic of Witten’s integral supports the existence of 
a large class of topological invariants of 3-manifolds and associated invariants of 
knots and links in these manifolds. 

The invariants associated with this integral have been given rigorous combinato- 
rial descriptions [31, 35, 41, 49, 51], but questions and conjectures arising from the 
integral formulation are still outstanding (see for example [3, 11—13, 40]). Specific 
conjectures about this integral take the form of just how it involves invariants of links 
and 3-manifolds, and how these invariants behave in certain limits of the coupling 
constant k in the integral. Many conjectures of this sort can be verified through 
the combinatorial models. On the other hand, the really outstanding conjecture 
about the integral is that it exists! At the present time there is no measure theory 
or generalization of measure theory that supports it. It is a fascinating exercise to 
take the speculation seriously, suppose that it does really work like an integral and 
explore the formal consequences. Here is a formal structure of great beauty. It is 
also a structure whose consequences can be verified by a remarkable variety of 
alternative means. Perhaps in the course of the exploration there will appear a hint 
of the true nature of this form of integration. 

We now look at the formalism of the Witten integral in more detail and see how 
it involves invariants of knots and links corresponding to each classical Lie algebra. 
In order to accomplish this task, we need to introduce the Wilson loop. The Wilson 
loop is an exponentiated version of integrating the gauge field along a loop K in 
three-space that we take to be an embedding (knot) or a curve with transversal self- 
intersections. For this discussion, the Wilson loop will be denoted by the notation 
Wx(A) =< K|A > to denote the dependence on the loop K and the field A. It is 
usually indicated by the symbolism tr(Pexp( J; xA)) . Thus 


Wx(A) =< K|A >= t(Pexpt A)). 
K 


Here the P denotes path ordered integration—we are integrating and exponentiating 
matrix valued functions, and so must keep track of the order of the operations. The 
symbol tr denotes the trace of the resulting matrix. 

With the help of the Wilson loop functional on knots and links, Witten writes 
down a functional integral for link invariants in a 3-manifold M: 


Z(M, K) = [ esesptcit/4xys.ayyr(Pesn( fay) 
= [ eerptiin/4ns <K|A>. 


Here S(M, A) is the Chern-Simons Lagrangian, as in the previous discussion. 
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We abbreviate S(M,A) as S and write < K|A > for the Wilson loop. Unless 
otherwise mentioned, the manifold M will be the three-dimensional sphere S* 

An analysis of the formalism of this functional integral reveals quite a bit about 
its role in knot theory. This analysis depends upon key facts relating the curvature 
of the gauge field to both the Wilson loop and the Chern-Simons Lagrangian. The 
idea for using the curvature in this way is due to Smolin [43, 44] (see also [38]). To 
this end, let us recall the local coordinate structure of the gauge field A(x), where x 
is a point in three-space. We can write A(x) = A‘(x)T“dx, where the index a ranges 
from 1 to m with the Lie algebra basis {7!, 7”, T?,..., T’"}. The index k goes from 
1 to 3. For each choice of a and k, Ar (x) is a smooth function defined on three-space. 
In A(x) we sum over the values of repeated indices. The Lie algebra generators T“ 
are matrices corresponding to a given representation of the Lie algebra of the gauge 
group G. We assume some properties of these matrices as follows: 


1. [T¢,T°] = ifaycT° where [x,y] = xy — yx , and fy. (the matrix of structure 
constants) is totally antisymmetric. There is summation over repeated indices. 

2. tr(T¢T’) = 5% /2 where 5“ is the Kronecker delta (6 = 1 if a = b and zero 
otherwise). 


We also assume some facts about curvature. (The reader may enjoy comparing 
with the exposition in [27]. But note the difference of conventions on the use of i in 
the Wilson loops and curvature definitions.) The first fact is the relation of Wilson 
loops and curvature for small loops: 


Fact 1. The result of evaluating a Wilson loop about a very small planar circle 
around a point x is proportional to the area enclosed by this circle times the 
corresponding value of the curvature tensor of the gauge field evaluated at x. 
The curvature tensor is written 


Fi (x)T“dx,dys. 


It is the local coordinate expression of AdA + AA. 


Application of Fact 1 Consider a given Wilson line < K|S >. Ask how its value 
will change if it is deformed infinitesimally in the neighborhood of a point x on 
the line. Approximate the change according to Fact 1, and regard the point x as the 
place of curvature evaluation. Let 6 < K|A > denote the change in the value of the 
line. 6 < K|A > is given by the formula 


6 < K|A >= dx,dx,FO(x)T* < K|A>. 


This is the first order approximation to the change in the Wilson line. 

In this formula it is understood that the Lie algebra matrices T“ are to be inserted 
into the Wilson line at the point x, and that we are summing over repeated indices. 
This means that each T* < K|A > is a new Wilson line obtained from the original 
line < K|A > by leaving the form of the loop unchanged, but inserting the matrix 
T“ into that loop at the point x. A Lie algebra generator is diagrammed by a little 
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Fig. 36 Wilson loop 
insertion 
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box with a single index line and two input/output lines which correspond to its role 
as a matrix (hence as mappings of a vector space to itself). See Fig. 36. 


Remark. In thinking about the Wilson line < K|A >= tr(Pexp(f;, A)), it is helpful 
to recall Euler’s formula for the exponential: 


& = limyso(1 + x/n)". 


The Wilson line is the limit, over partitions of the loop K, of products of the 
matrices (1 + A(x)) where x runs over the partition. Thus we can write symbolically, 


<K|A>=]| [0 +AQ)) =| [0 +4k@)T“dx). 


x€K x€K 


It is understood that a product of matrices around a closed loop connotes the trace 
of the product. The ordering is forced by the one-dimensional nature of the loop. 
Insertion of a given matrix into this product at a point on the loop is then a well- 
defined concept. If T is a given matrix then it is understood that T < K|A > denotes 
the insertion of T into some point of the loop. In the case above, it is understood 
from context in the formula 


dx,dx,F!(x)T* < K|A > 


that the insertion is to be performed at the point x indicated in the argument of the 
curvature. 

Remark. The previous remark implies the following formula for the variation of the 
Wilson loop with respect to the gauge field: 


8 < K|A > /5(A‘(x)) = dT? < K|A>. 
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Varying the Wilson loop with respect to the gauge field results in the insertion of 
an infinitesimal Lie algebra element into the loop. 


Proof. 
§ < K|A > /6(A*(x)) 


= 8] [C+ ALQ)T*dyx)/5(AK@)) 


yeK 
= [] G+Af0)7*dy)[T%dxx] |] d+ ALO) T*dyx) 
y<x€K y>xeK 
=dx,T’ < K|A>. 


Fact 2. The variation of the Chern-Simons Lagrangian S with respect to the gauge 
potential at a given point in three-space is related to the values of the curvature 
tensor at that point by the following formula: 


Fp (x) = €rs15S/5(A,(%)). 


Here €gp- is the epsilon symbol for three indices, i.e. it is +1 for positive 
permutations of 123 and —1 for negative permutations of 123 and zero if any two 
indices are repeated. 

With these facts at hand we are prepared to determine how the Witten integral 
behaves under a small deformation of the loop K. 

In accord with the theme of this paper, we shall use a system of abstract tensor 
diagrams to look at the differential algebra related to the functional integral. The 
translation to diagrams is accomplished with the aid of Figs. 37 and 38. In Fig. 37 we 
give diagrammatic equivalents for the component parts of our machinery. Tensors 
become labelled boxes. Indices become lines emanating from the boxes. Repeated 
indices that we intend to sum over become lines from one box to another. (The 
eye can immediately apprehend the repeated indices and the tensors where they are 
repeated.) Note that we use a capital D with lines extending from the top and the 
bottom for the partial derivative with respect to the gauge field, a capital W with 
a link diagrammatic subscript for the Wilson loop, a cubic vertex for the three- 
index epsilon, little triangles with emanating arcs for the differentials of the space 
variables. 

The Lie algebra generators are little boxes with single index lines and two 
input/output lines which correspond to their roles as matrices (hence as mappings 
of a vector space to itself). The Lie algebra generators are, in all cases of our 
calculation, inserted into the Wilson line either through the curvature tensor or 
through insertions related to differentiating the Wilson line. 

In Fig. 38 we give the diagrammatic calculation of the change of the functional 
integral corresponding to a tiny change in the Wilson loop. The result is a double 
insertion of Lie Algebra generators into the line, coupled with the presence of a 
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Fig. 37 Notation i j 


5/5 AK(x) 


a curvature tensor 


Chern - Simons Lagrangian 


wl 


volume form that will vanish if the deformation does not twist in three independent 
directions. This shows that the functional integral is formally invariant under regular 
isotopy since the regular isotopy moves are changes in the Wilson line that happen 
entirely in a plane. One does not expect the integral to be invariant under a 
Reidemeister move of type one, and it is not. This framing compensation can be 
determined by the methods that we are discussing [32], but we will not go into the 
details of those calculations here. 

In Fig. 39 we show the application of the calculation in Fig. 38 to the case of 
switching a crossing. The same formula applies, with a different interpretation, to 
the case where x is a double point of transversal self-intersection of a loop K, and 
the deformation consists in shifting one of the crossing segments perpendicularly to 
the plane of intersection so that the self-intersection point disappears. In this case, 
one 7“ is inserted into each of the transversal crossing segments so that T¢T’ < 
K|A > denotes a Wilson loop with a self-intersection at x and insertions of T% 
at x + €; and x + €2 where €; and €2 denote small displacements along the two 
arcs of K that intersect at x. In this case, the volume form is nonzero, with two 
directions coming from the plane of movement of one arc, and the perpendicular 
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direction is the direction of the other arc. The reason for the insertion into the two 
lines is a direct consequence of the calculational form of Fig. 38: The first insertion 
is in the moving line, due to curvature. The second insertion is the consequence 
of differentiating the self-touching Wilson line. Since this line can be regarded as 
a product, the differentiation occurs twice at the point of intersection, and it is the 
second direction that produces the non-vanishing volume form. 

Up to the choice of our conventions for constants, the switching formula is, as 
shown in Fig. 39, 


Z(K4.) — Z(K_) = (4xi/k) / dAexp|(ik/4sr)S|T“T" < Kex|A > 
= (4ni/K)Z(T°T"Kex). 


The key point is to notice that the Lie algebra insertion for this difference is 
exactly what we did to make the weight systems for Vassiliev invariants (without 
the framing compensation). Thus the formalism of the Witten functional integral 
takes us directly to these weight systems in the case of the classical Lie algebras. 
The functional integral is central to the structure of the Vassiliev invariants. 
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5.3. Combinatorial Constructions for Vassiliev Invariants 


Perhaps the most remarkable thing about this story of the structure of the Vassiliev 
invariants is the way that Lie algebras are so naturally involved in the structure of the 
weight systems. This shows the remarkably close nature of the combinatorial struc- 
ture of Lie algebras and the combinatorics of knots and links via the Reidemeister 
moves. A really complete story about the Vassiliev invariants at this combinatorial 
level would produce their existence on the basis of the weight systems with entirely 
elementary arguments. 

As we have already mentioned, one can prove that a given set of weights for 
the top row, satisfying the abstract four-term relation does imply that there exists a 
Vassiliev invariant of finite type n realizing these weights for graphs with n nodes. 
Proofs of this result either use analysis [1, 4] or non-trivial algebra [4, 7]. There is 
no known elementary combinatorial proof of the existence of Vassiliev invariants 
for given top rows. 

Of course quantum link invariants (see Sect.4 of these lectures.) do give 
combinatorial constructions for large classes of link invariants. These constructions 
rest on solutions to the Yang-Baxter equations, and it is not known how to describe 
the subset of finite type Vassiliev invariants that are so produced. 

It is certainly helpful to look at the structure of Vassiliev invariants that arise from 
already-defined knot invariants. If V(K) is an already defined invariant of knots (and 
possibly links), then its extension to a Vassiliev invariant is calculated on embedded 
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graphs G by expanding each graphical vertex into a difference by resolving the 
vertex into a positive crossing and a negative crossing. If we know that V(K) is 
of finite type n and G has n nodes then we can take any embedding of G that is 
convenient, and calculate V(G) in terms of all the knots that arise in resolving the 
nodes of this chosen embedding. This is a finite collection of knots. Since there 
is a finite collection of 4-valent graphs with n nodes, it follows that the top row 
evaluation for the invariant V(K) is determined by the values of V(K) on a finite 
collection of knots. Instead of asking for the values of the Vassiliev invariant on 
a top row, we can ask for this set of knots and the values of the invariant on this 
set of knots. A minimal set of knots that can be used to generate a given Vassiliev 
invariant will be called a knots basis for the invariant. Thus we have shown that the 
set consisting of the unknot, the right-handed trefoil and the left handed-trefoil is a 
knots basis for a Vassiliev invariant of type 3. See [36] for more information about 
this point of view. 

A tantalizing combinatorial approach to Vassiliev invariants is due to Polyak and 
Viro [37]. They give explicit formulas for the second, third and fourth Vassiliev 
invariants and conjecture that their method will work for Vassiliev invariants of all 
orders. The method is as follows. 

First one makes a new representation for oriented knots by taking Gauss 
diagrams. A Gauss diagram is a diagrammatic representation of the classical Gauss 
code of the knot. The Gauss code is obtained from the oriented knot diagram by first 
labelling each crossing with a naming label (such as 1,2,...) and also indicating the 
crossing type (+1 or —1). Then choose a basepoint on the knot diagram and begin 
walking along the diagram, recording the name of the crossings encountered, their 
sign and whether the walk takes you over or under that crossing. For example, if 
you go under crossing | whose sign is + then you will record o + 1. Thus the Gauss 
code of the positive trefoil diagram is 


(o1+)(u2+)(03+)(ul+)(02+)(u3+). 


For prime knots the Gauss code is sufficient information to reconstruct the knot 
diagram. See [29] for a sketch of the proof of this result and for other references. 

To form a Gauss diagram from a Gauss code, take an oriented circle with a 
basepoint chosen on the circle. Walk along the circle marking it with the labels for 
the crossings in the order of the Gauss code. Now draw chords between the points 
on the circle that have the same label. Orient each chord from overcrossing site to 
undercrossing site. Mark each chord with +1 or —1 according to the sign of the 
corresponding crossing in the Gauss code. The resulting labelled and basepointed 
graph is the Gauss diagram for the knot. See Fig. 40 for examples. 

The Gauss diagram is deliberately formulated to have the structure of a chord 
diagram (as we have discussed for the weight systems for Vassiliev invariants). 
If G(K) is the Gauss diagram for a knot K, and D is an oriented (i.e. the chords 
as well as the circle in the diagram are oriented) chord diagram, let |G(K)| denote 
the number of chords in G(K) and |D| denote the number of chords in D. If 
|D| < |G(K)| then we may consider oriented embeddings of D in G(K). For a 
given embedding i : D —> G(K) define 
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Fig. 40 Gauss diagrams 


< i(D)|G(K) >= sign(i) 


where sign(i) denotes the product of the signs of the chords in G(K) MN i(D). Now 
suppose that C is a collection of oriented chord diagrams, each with n chords, 
and that 


eval: C—>R 


is an evaluation mapping on these diagrams that satisfies the four-term relation at 
level n. Then we can define 


<D|IK>= > <i(D)|G(K)> 
i:D—>G(K) 


and 


v(K) = )° < D|K > eval(D). 
DEC 
For appropriate oriented chord subsets this definition can produce Vassiliev 
invariants v(K) of type n. For example, in the case of the Vassiliev invariant of 
type three taking value 0 on the unknot and value 1 on the right-handed trefoil, —1 
on the left-handed trefoil, Polyak and Viro give the specific formula 


u3(K) =< A|K > +(1/2) < B|K > 


where A denotes the trefoil chord diagram as we described it in Sect.3 and B 
denotes the three-chord diagram consisting of two parallel chords pierced by a third 
chord. In Fig. 41 we show the specific orientations for the chord diagrams A and B. 
The key to this construction is in the choice of orientations for the chord diagrams in 
C = {A, B}. It is a nice exercise in translation of the Reidemeister moves to Gauss 
diagrams to see that v3(K) is indeed a knot invariant. 
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Fig. 41 Oriented chord 
diagrams for v3 HE 
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It is possible that all Vassiliev invariants can be constructed by a method similar 
to the formula v(K) = )onec < D|K > eval(D). This remains to be seen. 


5.4 Invertibility and the knot 847 


It is an open problem whether there are Vassiliev invariants that can detect the 
difference between a knot and its reverse (The reverse of an oriented knot is obtained 
by flipping the orientation.). The smallest instance of a non-invertible knot is the 
knot 8,7 depicted in Fig.42. Hale Trotter [46] was the first person to give proofs 
that some knots are non-invertible. We have not discussed his methods in this paper. 

Thus, at the time of this writing there is no known Vassiliev invariant that can 
detect the non-invertibility of 8,7. On the other hand, the tangle decomposition 
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shown in Fig.42 can be used in conjunction with the results of Siebenmann 
and Bonahon [42] and the formulations of John Conway [8] to show this non- 
invertibility. These tangle decomposition methods use higher level information 
about the diagrams than is easy to encode in Vassiliev invariants. The purpose of this 
section is to underline this discrepancy between different levels in the combinatorial 


topology. 
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How Can Cooperative Game Theory Be Made 
More Relevant to Economics? : An Open 
Problem 


Eric Maskin 


Abstract Game Theory pioneers J. von Neumann and O. Morgenstern gave most 
of their attention to the cooperative side of the subject. But cooperative game theory 
has had relatively little effect on economics. In this essay, I suggest why that might 
be and what is needed for cooperative theory to become more relevant to economics. 


Cooperative game theory is the part of game theory that pertains when players can 
sign binding contracts determining their actions and payoffs. J. von Neumann and 
O. Morgenstern devoted most of their seminal book [6] to cooperative theory, with 
subsequent major contributions by Nash [4] and Shapley [5]. 

But despite its auspicious beginnings, cooperative game theory has been used far 
less than noncooperative theory as a predictive tool in economics. Indeed, inspection 
of the current leading game theory textbooks used in graduate economics programs 
reveals that the ratio of cooperative to noncooperative theory is remarkably low (in 
one such text, [1], the ratio is 0). And all Nobel Memorial Prizes awarded for game 
theory to date have recognized work exclusively on the noncooperative side. 

This imbalance may seem strange. Cooperative theory seems to offer the 
important advantage of giving insight into how coalitions behave, i.e., how subsets 
of players bargain over which actions are played. Such bargaining seems basic 
to many aspects of economic and political life from the European Union, to the 
Paris climate change agreement, to the OPEC cartel. Moreover, on the face of 
it, cooperative theory appears to be far less dependent on particular details about 
strategies—and, therefore, more robust and general—than noncooperative theory. 

To understand the sense in which noncooperative is more detail dependent, 
let us briefly go over the basic noncooperative and cooperative models. In a 
noncooperative game, each player i,i = 1,...,n, chooses a strategy s; from a 
strategy set S;, and the payoffs of the game are given by the mapping 
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where gi(si,...,5,) is player i’s payoff if strategies (s1,...,5,) are played. The 
standard prediction for what will happen in game g is that players will choose Nash 
equilibrium strategies [3]. Strategies (s},..., 57) constitute a Nash equilibrium if 


BAS aes ss Sq) = GC seo Spgs Sis Spe jssea8,,)> forall cand all's; eS; (1) 


As formula (1) makes clear, Nash equilibrium depends crucially on what strategies 
are or are not in each player’s strategy set; for example, adding a single strategy s’ to 
S; can destroy (s},..., 57) as an equilibrium and change the predicted outcome of 
the game discontinuously, even if s; generates payoffs quite similar to those of s*. 

By contrast, cooperative games are typically described by a characteristic 
function v. Given a coalition of players S C {1,...,”}, v(S) is the sum of 
payoffs that the members of S can get on their own. An often-used predictive 
concept in cooperative game theory is the Shapley value (unlike Nash equilibrium 
in noncooperative theory, the Shapley value has serious competition as a predictive 
concept; there are some other leading notions, such as the core and the bargaining 
set). Given characteristic function v, player i’s Shapley value payoff is 


S|!(a— |S 
> Bhat wisu ta) — 1), 


ie., player i gets his expected marginal contribution to coalitions, where the 
expectation is taken over all possible coalitions that he might join. 

Notice that in the cooperative-game setting, players’ strategies no longer are 
modeled explicitly—only the resulting payoffs matter. Thus, many different non- 
cooperative games can be associated with the same characteristic function. In that 
sense, the characteristic function approach is more general than the noncooperative 
model. Moreover, in cooperative games, the discontinuities that arise in noncoop- 
erative games no longer occur: the characteristic function and Shapley value vary 
continuously with the payoff possibilities. In that sense, cooperative games are more 
robust than noncooperative games. 

So why, despite these advantages, is cooperative game theory currently domi- 
nated by noncooperative theory as applied to economics? Perhaps one answer is 
that the characteristic function, by assumption, rules out externalities—situations 
in which a coalition’s payoff depends on what other coalitions are doing. Yet, 
interactions between coalitions are at the very heart of economics, e.g., bargaining 
between unions and management, competition between companies, and trade 
between nations. Moreover, even in the (relatively small) cooperative literature 
that does accommodate externalities (the partition-function approach; see [2]), 
extensions of the Shapley value and of other leading cooperative concepts do 
not predict competition between coalitions; instead, they assume as a matter of 
definition that the grand coalition—the coalition of all players—always forms. Of 
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course this flies in the face of reality, where, in most settings, we don’t typically see 
just a single big coalition, but rather several smaller coalitions. Furthermore, there is 
a good theoretical reason why, in a model with externalities, we should not expect 
the grand coalition to form. 

To illustrate this point, let us consider the following three-player game, in which 
coalitions can produce public goods. The coalition of players 1 and 2—{1, 2}—-can 
produce a total payoff of 12 for itself, {1, 3} can produce 13, and {2, 3} can produce 
14. The grand coalition {1,2,3} can produce 24. A player can produce nothing on 
his own. However, if the other two players form a coalition, he can free-ride on the 
public good they produce and enjoy a payoff of 9 (which is the externality that the 
coalition confers on him). 

I claim that, we should not expect the grand coalition to form in this game. To 
see why not, imagine that all bargaining is conducted at a particular site and player 
1 arrives there first, followed by 2, and finally by 3. When player 2 arrives, player 1 
can make him offer to join | in a coalition. Let us explore what 2 must be offered to 
be willing to join. Notice that if he does not join with 1, he will be in competition 
with | for signing up 3. In this competition, | will be willing to bid 13 (the gross 
value of the coalition with 3) minus 9 (which he would get as a free-rider if 3 signed 
up with 2), i.e., 4. Similarly, 2 will be willing to bid 14—9 = 5. Hence, 2 will win the 
bidding war for 3 and will pay 4 (notice that because, in this thought experiment, 
1 and 2 don’t form a coalition, 3 has no possibility of free-riding and so will be 
willing to accept 4). Hence 2’s payoff if he refuses to join with 1 is 14 — 4 = 10. 
Thus, player | must offer him 10 in order to sign him up. 

Assuming 2 is signed up, 1 must then offer 3 a payoff of 9 to attract him to 
coalition {1,2} (because 3 has the option to free-ride on {1,2} and get 9 that way). 
Hence, altogether player 1 must pay 10 + 9 = 19 in order to form the grand 
coalition. But this leaves only 24 — 19 = 5 for himself. Clearly, he would be better 
off refraining from signing up 2—in which case, as analyzed above, 2 will form a 
coalition with 3. And | obtains a free-riding payoff of 9. 

I conclude that with arrival order 1, 2, 3, two separate coalitions will form: {2, 3} 
and {1}. A similar conclusion follows for the five other possible arrival orders. 

Unfortunately, cooperative game theory in its current state does not allow for 
such a two-coalition outcome. In my view, it remains an open problem—perhaps 
the most important open problem in cooperative theory—to develop an approach 
that properly accommodates the formation of multiple coalitions. Only by solving 
this problem can we make cooperative game theory relevant to economics. 
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The Erdos-Szekeres Problem 


Walter Morris and Valeriu Soltan 


Abstract Erdés and Szekeres proved in their 1935 paper that for every integer n > 
3 there exists a smallest positive integer N(n) such that any set of at least V(7) points 
in general position in the plane contains points which are the vertices of a convex 
n-gon. They also posed the problem to determine the value of N(n) and conjectured 
that N(n) = 2”? + 1 for all n > 3. Despite the efforts of many mathematicians, the 
Erdés-Szekeres problem is still far from being solved. This chapter describes recent 
achievements towards the solution of this problem and some of its close relatives. 


1 Introduction 


In 1932, the young Hungarian mathematician Esther Klein observed that any set of 
five points in general position in the plane (that is, no three of the points belong to a 
line) contains the vertices of a convex quadrilateral. 

Indeed, there are three distinct types of placement of five points in general 
position in the plane, as shown in Fig. 1. In any of these cases, we can pick out 
at least one convex quadrilateral determined by the points. Furthermore, the chosen 
quadrilateral may contain no other point of the set in its interior (see Sect.3 for a 
related problem). 

Esther Klein realized that this elementary fact could be a particular case of 
the following interesting problem, which she posed to her friends Paul Erdés and 
George Szekeres: given an integer n > 3, does there exist a positive integer N(n) 
such that from any set containing at least N(n) points in general position in the 
plane, it is possible to choose n points which form a convex n-gon? 

In their enormously influential paper [16] from 1935, Erdés and Szekeres proved 
the existence of the number N(n) by two different methods. The first method 
uses Ramsey’s theorem (independently discovered by Szekeres for the purpose of 
solving Klein’s problem) and gives the estimate N(n) < R4(5,n), where R4(5, 7) 
is a Ramsey number (see Sect.2.4 for details). The second method, based on 
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Fig. 1 Any five points in general position in the plane determine a convex quadrilateral 


consideration of convex and concave sequences of points, produces a better upper 
bound, V(n) < Cs) +1. In the same paper, Erdés and Szekeres posed their famous 
conjecture, often formulated as a problem. 


Problem 1. Is it true that N(n) = 2"-? + 1 for all n > 3? 


The conjecture that N(n) = 2”? + 1 for all n > 3 is attributed to Klein by 
Szekeres and Peters in [43]. In 1961, Erdés and Szekeres [17] returned to their 
problem; they gave an example of 2”~? points in general position in the plane 
containing no vertex set of a convex n-gon. In other words, they established the 
inequality N(n) > 2”-? +1. 

Their work on Problem | preceded and perhaps led to the marriage of Klein 
and Szekeres, prompting Erdés to call Problem | the “Happy End Problem.” Their 
passing (see [13]) was shortly before the Szekeres-Peters paper [43] appeared and 
established that V(6) = 17. Thus Problem | framed their marriage of nearly seventy 
years. Additional historic and bibliographic comments related to this topic can be 
found in [36]. 

Problem 1 is steadily gaining interest, generating new original questions and 
directions of research. An essential part of the existing results were summarized 
around 2000 in the surveys of Barany and Karolyi [2] and Morris and Soltan [36]. 
This topic also is considered in separate chapters of the books of MatouSek [35] 
and Brass et al. [8]. At present we witness a further development of the field, with 
research to find the broadest generalizations and the deepest essence of Erdés and 
Szekeres’ original results. 

This chapter is based on the authors’ survey [36]. It gives an updated account of 
results immediately related to the Erdés-Szekeres problem, leaving more distant 
topics for further analysis. The content of the chapter is indicated by section 
headings as follows. 


. Introduction. 

. The Erdés-Szekeres problem on convex polygons. 
. The Erdés problem on empty convex polygons. 

. Higher dimensional extensions. 


WN Fe 


Some words about notation: |X| stands for the cardinality of a finite set X; if X is a 
subset of the Euclidean space E, then aff X and conv X mean, respectively, the affine 
and convex hull of X. A set X C E? is said to be convexly independent if no point 
of X lies in the convex hull of the remaining points. Furthermore, (p,q) denotes the 
line through distinct points p and gq, and [p, qg) stands for the closed halfline through 
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q with endpoint p. Given functions f(x) and g(x) on the halfline [0, 00), we write 
f = O(g) and g = Q(f) provided there are positive constants C and k such that 
If(x)| < Clg(x)| for all x > k. Finally, [m] means the set {1,2,...,m}, and (im) 
denotes the family of all subsets of [m] of cardinality k. 


2 The Erdés-Szekeres Problem on Convex Polygons 


This section describes the advances in solving the Erdés-Szekeres problem in the 
plane. We start with the “caps and cups” technique from [16], which gives upper 
bounds on the number NV(n), that are O(4"). 


2.1 Caps and Cups 


Let X = {pj,P2,---,Pm} be a set of points in general position in the coordinate 
plane. If necessary, we can rotate X by a small angle to ensure that no two points of 
X have the same x-coordinate. Renumbering these points, we also may assume that 
p; has a greater x-coordinate than p; whenever i < j. For a subset J = {i),ip,..., in} 
of the set [m] = {1,2,...,m}, we say that the set {p; : i € J} is an n-cap if pj, , 
lies above the line through p;, and p;.,, for all 7 = 1,2,...,n — 2. Similarly, the 
set {p; : i € J} is an n-cup if p;,,, lies below the line through p;, and p;,,, for all 
j=1,2,...,n—2 (see examples in Fig. 2). We will call p;, the left endpoint and p;, 
the right endpoint of the n-cap (or an n-cup) {pj,, Pi,,-- +. DPi,}- 

Given integers k, 2 > 3, define f(k, £) as the smallest positive integer for which 
a set X in general position in the plane contains a k-cup or an £-cap whenever X has 
at least f(k, £) points. The following lemma gives a key argument in known proofs 
for upper bounds on N(n). 


Lemma 1. Suppose that a set X in general position in the plane contains at least 
f(k-1L, O+f(k, €—1)—1 points, where k, £ > 3. If the left endpoint p;, of a (k—1)- 
CUP {Pi,,-+ ++ Pi,_,} is the right endpoint p;,_, of an (€ — 1)-cap {pj,. Pix. +--+ Pje—}s 
then X contains a k-cup {Pjp_,Pi,sPins--+>Piz_} OF an €-cap {Pj,. jn.» ~ + Pip 
Pi,}, depending on whether pj, is below or above the line through pj,_, and Pijy. 
The same conclusion is reached if the left endpoint of an (£ — 1)-cap is the right 
endpoint of a (k — 1)-cup (Fig. 3). Oo 


Fig. 2. A cup and a cap /o\ 
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Fig. 3. A cap followed by a 
cup 


Theorem 1. f(k,£) < (“15*) + 1 forall k, £ > 3. 


Proof. The inequality follows from the boundary conditions 


f%,3) =/3,0 =k= (‘-,) ng 


and the recurrence 
f(k, 8) <fk-1,0 +f, €-1)-1. 


Here is a proof of the recurrence. Choose a set X in general position in the plane 
with at least f(k—1, 2) +f (k, €—1)—1 points. Let Y be the set of right endpoints of 
(€—1)-caps of X. If X\ Y contains f(k, £—1) points, then it contains a k-cup, because 
X \ Y contains no (€ — 1)-cap. Otherwise Y contains f(k — 1, £) points. Suppose that 
Y contains a (k — 1)-cup {pi,, Pi. +++» Pip} Let {Pj,, Pins +++ Pje_,} be an (€ — 1)- 
cap with jp_; = i). By Lemma 1, X contains a k-cup {pj,_,, Pi, Pins +++» Piz) } OF AN 
€-cap {Pj ,Pjxs-- + +Pje—1+ Pin} Finally, 


fk-1O+fk,£-D)-1< (eee rea eer cree 


_ fe+t—4 “4 
~\ k-2 o 
The inequality N(n) < f(n,n) immediately implies the following upper bound 
on N(n), obtained by Erdés and Szekeres [16] in 1935. 


Corollary 1 ({16]). N(n) < ?"5') + 1 for all n > 3. o 


In this regard we observe that the argument, which show the existence of a 
convex n-gon, actually proves the stronger statement that either an n-cap or an 
n-cup is contained in the set. It is obvious, but perhaps worth mentioning, that if 
|X| > f(n,n), then every set obtained by rotating X will contain an n-cup or an 
n-cap; thus the set X contains several sets that are rotations of n-cups or n-caps. 

The idea for the lower bound construction comes from Erdés and Szekeres’ 1961 
paper [17]. A correction to an error in that paper was made by Kalbfleisch and 
Stanton [27]. The following proof is taken from Lovasz [34, Sect. 14]. 
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Theorem 2. f(k, 0) = (“F'5*) + 1 for all k, € > 3. 


Proof. By Theorem 1, it suffices to prove the inequality f(k, £2) > ours +1. 

We argue by double induction on k and £. As previously observed, this inequality 
is true fork = 3 and £ = 3. Suppose that we have a set A of ean 7 ) points in general 
position, with no (k — 1)-cup and no ¢-cap, and a set B of eae ‘) points in general 
position, with no k-cup and no (€ — 1)-cap. Translate these sets so that every point 
of B has greater x-coordinate than the x-coordinates of points of A, and the slope of 
any line connecting a point of A to a point of B is greater than the slope of any line 
connecting two points of B. Let X = A U B be the resulting set. Any cup in X that 
contains elements of both A and B may have only one element of B. Thus X contains 
no k-cup. We similarly see that X contains no €-cap. Consequently, 


k+¢l-4 
Fit) = + 1a)+1 = ( a Jes a 


The next theorem gives the lower bound on N(n) obtained by Erd6és and 
Szekeres [17]. 


Theorem 3 ({17]). N(n) > 2”? + 1 for alln > 3. 


Proof. For i = 0,1,...,n — 2, let T; be a set of (",”) points in general position 
in the plane, containing no (i + 2)-cap and no (n — 7)-cup and having the property 
that no two points in the set are connected by a line having slope of absolute value 
greater than |. (The existence of 7; follows from Theorem 2, and the condition on 
slopes can be satisfied by “compressing” 7; vertically.) Place a small eony of T; ina 
neighborhood of the point on the unit circle making an angle of | — 5 ay 5 With the 
positive x-axis. Let X be the union of 7), 7>,..., T,—2. Clearly, x] = — ime) 

Suppose that Y is a convexly independent subset of X. Let k and £ be the smallest 
and the largest values of iso that YT; 4 0. If k = @, then Y contains no (k+2)-cap 
and no (n — k)-cup. The construction guarantees that: 


1. YO Tis acap of at most k + 1 points, 
2. YN T; is acup of at most n — £ — 1 points, 
3. |YONT;| < 1 forali=k+1,k+2,...,€-1. 


Summing up, |Y| < (K+ 1)+ @-k-1)4+ (-€-1)=n-1. Oo 


A 16-point set with no convex hexagon and with an appealing circular symmetry 
is shown in the book of Bokowski [6, p. 283]. It would be interesting to see this 
example generalized to larger sets with 2”~* points and no convex n-gon. 
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2.2 Improvements of the Upper Bounds on N(n) 


The first improvement of the inequality N(n) < oe 3) + 1 was made in 1998 by 
Chung and Graham [11], who reduced the number = ‘) + 1 by one. This was 
followed shortly afterward by Kleitman and Pachter’s (30] improvement to N(n) < 
=) —2n+ 7. A key new idea of a projectively transformed set led Téth and 
Valtr [44] to lower the bound further to N(n) < ) + 2. In 2005, the same 
authors proved the following result. 


Theorem 4 ([45]). N(n) < 73) + 1 forall n> 5. 


Proof. Let X be a set of ‘ee ) + 1 points in general position in the plane. Choose 
an extreme point p of conv X, and let g be a point outside conv X such that no line 
determined by a pair of points in X \ {p} meets the segment [p, g]. Also, choose a 
line L through q disjoint from conv X. 

Let T be a projective transformation that maps L to the line at infinity and also 
maps the segment [p,q] to the vertical halfline emanating downward from T(p). 
Clearly, the set T(X) has the following properties: 


1. a subset Y of X is convexly independent if and only if T(Y) is, 
2. a subset Z of X containing p is convexly independent if and only if T(Z \ {p}) is 
a cap. 


For the rest of the proof, assume that T(X \ {p}) does not contain any (n — 1)-cap 
or n-cup. Let Y be the set of points in T(X \ {p}) that are right endpoints of (n — 2)- 
caps. If |Y| > ( | then Y contains an (n — 1)-cap or an (n — 1)-cup. Since the 
first case contradicts the assumption, Y contains an (n — 1)-cup. The left endpoint 
of this cup is the right endpoint of an (n — 2)-cap. By Lemma 1, T(X \ {p}) would 
have an (n — 1)-cap or an n-cup, and we are assuming that neither of these may 
occur. If |7(X \ {p}) \ ¥| > C9), then T(X \ {p}) \ ¥ contains an (n — 2)-cap or an 
n-cup. The first of these cannot happen because Y contains all the right endpoints of 
(n — 2)-caps. 

Summing up, we have established that 


ae es) and |T(X\ {p})\ ¥| = (7 


If t € T(X \ {p}) \ Y, then Y U {¢} contains ee ) + 1 points; so it contains an 
(n — 1)-cup. If the left endpoint of this cup is in Y, then we have an (n — 1)-cap or 
n-cup in T(X \ {p}) by Lemma 1. Therefore, every element of T(X \ {p}) \ Y is the 
left endpoint of an (n — 1)-cup with right endpoint in Y. This implies, by the same 
line of reasoning, that every element of Y is the right endpoint of an (n—2)-cap with 
left endpoint in T(X \ {p}) \ Y. 

Let S be the set of all segments [v, v’], where v € T(X \ {p}) \ Yand v’ € Y, 
and there is an (n — 2)-cap or (n — 1)-cup with left endpoint v and right endpoint v’. 
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Assume that a segment [v, v’] from S has the largest slope. Suppose [v, v’] represents 
an (n — 1)-cup with elements v = v1, V2,...,Un—) = v’ listed in order from left 
to right. There is an (n — 2)-cap with right endpoint v’ and left endpoint in T(X \ 
{p}) \ Y. Let uy, uz, ..., Un—2 be its points listed from left to right. If u,—3 lies above 
the line through v1, v2, then {u,—3, V1, V2,..., U,—1} is a convexly independent set. 
Otherwise, {u1,U2,...,Un—3, U1, U2} is an (n — 1)-cap in T(X \ {p}). The argument 
for the case where [v, v’] represents an (n — 2)-cap is similar to the previous case. 

oO 


Strunk [42] proves some improved bounds for point sets satisfying some 
conditions. For example, he proves that for n > 4, any set of () —2n-+ 10 points 
in general position in the plane with an (n — 1)-gon as its convex hull contains a 


convex n-gon. 


2.3. The Values of N(n) for Small n 


Esther Klein’s observation (see Sect. 1) shows that N(4) = 5. The next value, 
N(5) = 9, was first published in 1970 by Kalbfleisch et al. [26]. The proof below is 
due to Bonnice [7]. 

We will say that a set of nine points X in general position in the plane is of type 
(k,, ko, k3) if the set X, of vertices of conv X has k; elements, the set Xz of vertices 
of conv (X \ Xj) has kp elements, and |X \ (X; U X2)| = ks (see Fig. 4). 

Suppose a set of points Q = {x, y, z, w} is convexly independent and the points 
are encountered in the order x, y, z, w as one traverses the convex hull of Q. The 
convex polygonal region bounded by the halflines [y, x), [z, w) and the segment [y, z| 
is called the beam yz : xw (see Fig.5). Note that the beam yz : xw is bounded 
if the halflines [y, x) and [z, w) meet. Similarly, for non-collinear points x, y, w, the 
convex cone bounded by the halflines [y, x) and [y, w) is called a beam and is denoted 
yixw. 

Suppose that X is of type (3, 3,2), and let X; = {z1,z.}. There are two points 
of X. = {v}, V2, v3}, say Vv; and v2, on the same side of the line containing X3. We 


Fig. 4 A set of type (4,3,2) 


Fig. 5 A beam yz: xw w 
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can assume that the halfline [z), z2) meets the segment [v1, v3]. If there is a point of 
X, = {y}, 2, y3} in the beam z;Z> : v; V2, then this point, together with z,, 22, v1, V2, 
forms a convex pentagon. Otherwise, we can assume that there are two points of X1, 
say y; and yo, that are both in one of the beams z, : v2v3 and Zz) : Vv; v3. In either 
case, y, and y>, together with the three points defining the beam, form a convexly 
independent set of five points. 

Suppose that X is of type (4,3, 1). Let X3; = {z}. Then one of the three beams 
formed by z and two points of X must contain two points of X,. As in the previous 
case, these two points, together with the three points defining the beam, form a 
convex pentagon. 

Finally, suppose that X is of type (3, 4, 2). Put X3 = {z,, Z)}, and let the points of 
X2 be Vj, V2, V3, U4, appearing in the order v,, V2, V3, V4 as One traverses the boundary 
of conv X>. If the line through z; and z, has one point of X2, say v;, on one side and 
the other three points of X2 on the other side, then the points z;, Z2, v2, v3, v4 form 
a convex pentagon. Thus we may assume that v; and v2 are on one side of the line 
(z1, 22), while v3 and v4 are on the other side. We let the halfline [z,, z.) meet the 
segment [v, v3]. If the beam z;z2 : vj v2 or the beam z,z : v3v4 contains a point 
of X, then this point, together with the four defining the beam, forms a convex 
pentagon. Otherwise, there are two points of X; in one of the beams z; : v; v4 and 
Z2 : U2U3. In both cases, the two points of X; together with the three points defining 
the beam yield a convex pentagon. 


Theorem 5. N(5) = 9. 


Proof. Let X be a set of 9 points in general position in the plane. If conv X has less 
than 5 points, it must be of one of the types 


(4,4,1), (4,3,2), (3,4,2), (3,3, 3). 


Clearly, each set of the first two types contains a subset of type (4, 3, 1). The third 
type, (3, 4, 2), has been considered above, and a set of type (3, 3, 3) contains a subset 
of type (3, 3, 2). In every case, X contains a subset of 5 convexly independent points. 

Oo 


The main virtue of this proof is its brevity. A similar approach was used by 
Dehnhardt et al. [14] to show, in seventeen pages, that if X is a set of 17 points in 
general position in the plane and conv X has 5 vertices, then X contains 6 convexly 
independent points. The rapid increase in the number of cases seems to limit the 
method of classifying point sets by the sequence of sizes of the nested convex 
hulls. This method, however, was surprisingly effective in the solution of the empty 
hexagon problem discussed in Sect. 3. 

The next value of N(n), namely N(6) = 17, follows from a much stronger result 
of Szekeres and Peters [43], described below. 

Suppose that X = {p1,p2,...,Pm} is a set in general position in the plane such 
that the x-coordinate of p; is less than the x-coordinate of p; whenever i < j. Define 
a function oy : (im) —> {-1, 1} by ox({i,j,k}) = 1Lifi <j < k and p; is above 
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the line through p; and p;, and ox({i,j,k}) = —1 otherwise. A function o : (fl) > 
{-1, 1} will be called realizable if there is a planar set X in general position such 
that o = ox. 

The geometric meaning of this approach can be explained as follows: if a 


function o : (fl) — {-1, 1} is realizable, then for a set 


T= {i,ib,...,in} C [m] = {1,2,...,m} 
and sets {i,,i,} C A C Jand B = (1 \ A) U {i), i,}, expressed as 
A= {iy =a, <a. <+++< ay = in}, B= {iy = b; < bo <tee < Dy—-K42 = ints 


the equality 


k—2 n—k 


> 0 (ai, Gi41, 4i42}) — > 0 ({bi, bi41, bi42}) =n —2 e(I, A) 


i=1 j=l 


implies the existence of a convex n-gon made up of an |A|-cap indexed by elements 
of A and a |B|-cup indexed by elements of B. 

Denote by N the smallest integer m such that there is no function o : (i) > 
{—1, 1} satisfying the following condition: the inequality 


k—2 n—k 


oO ({i, Gi41, 4i42}) — a 0 ({bi, bit1, bi42}) <n—2 iI, A) 


i=l j=l 
holds for all J = {i,,i2,...,i,} C [m] and {i,,i,} C A C I. We will denote by i(/) 
the conjunction of all inequalities i(/, A) for a given set J. 

Theorem 6 ([43]). N(4) = 5. 


Proof. This tiny argument from [43] can serve as a model for later computer 
searches. Suppose that a function o : (2) — {-1, 1} satisfies the inequality i(/, A) 
for all 


T= {i,, ib, 13, i4} C [5] and {ij,i4} CA CT. 


Assume, without loss of generality, that o({1,2,3}) = 1. From the obvious 
inequality i({1, 2,3, 5}, {1,2, 3, 5}) it follows that 0 ({2, 3,5} = —1. Then 

i({2, 3,4, 5}, {2,4, 53) > o({2, 4, 5}) = -1, 

i({1, 2,4, 5}, {1, 5}) > o({1, 2, 4}) = 1, 

i({1, 2,3, 4}, {1,2, 4}) => o({1, 3, 4}) = 1, 
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i({1, 3, 4, 5}, {1, 3, 4, 5}) => o({3, 4, 5}) = -1, 
i({2, 3,4, 5}, {2, 5}) > o({2,3, 43) = 1. 


Now i({1, 2,3, 4}, {1, 2, 3, 4}) implies o({1, 2, 3}) = —1, contradicting our original 
assumption. Oo 


By a computer search, Szekeres and Peters established the following statement. 
Theorem 7 ([43]). N(5) = 9. 


It should be emphasized that Theorem 7 is much stronger than Theorem 5, 
because it holds for the class of all functions o : (2) — {-1,1}, which is much 
larger than the class of realizable functions ox : (Pl) > {-1,]}. 

Szekeres and Peters raise the fascinating possibility that N(n) = 2-2 +1 might 
hold for all n. This would not contradict the lower bound on the Ramsey number 
R3(n,n), as discussed in Sect.2.4. Indeed, the equality e(/,A) for some A C I 
determines the values of o({i,j,k}) for all {i,j,k} C I when the function o is 
realizable, while this is not the case for non-realizable functions o. 

Szekeres and Peters were not able to determine whether N (6) = 17, because they 
could not utilize the constraints i(/,A) efficiently enough to force a contradiction 
within the time they had allotted for the computation. This led them to the 
introduction of the 4-realizability constraints, which hold for realizable functions o. 

For a given : (ea) — {-1, 1} and a 4-element set J = {i, < in < i3 < ig} C 
[m], a realizable o must satisfy either 


O({i1, io, i3}) = oft, i2,i4}) and o({ij, i3, i4}) = oO (id, is, i4}) 
if the points indexed by 1, iz, i3, 14 are convexly independent, or 


O (ti, i2, i3}) = —o (fin, 3, i4}) and o({i, in, i4}) = oi, is, i4}) 


if the point indexed by i, or i3 is in the convex hull of the other three. The function 
satisfies one or the other of these if and only if the following system i,(J) of 
inequalities holds: 


— 3 < o(fit, i, igh) — Oi, is, 4h) + 0 (lin, i3, igh) < 3, 


5 tied ee de is) 
—3 < o({ij, iz, i3}) — oO ({i1, to, i4}) + O (fb, is, i4}) < 3. 
If o satisfies the condition i4(J) for all J C [m], we say that o is 4-realizable. 

We define N*(n) as the smallest integer m such that there is no 4-realizable 
function o : (G) — {-1, 1} so that all inequalities i(7, A) hold. These additional 
restrictions were enough to allow Szekeres and Peters to establish the following 
result. 


Theorem 8. N*(6) = 17. 
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Theorem 8 is equivalent to the statement that the generalization of the conjecture 
N(6) = 17 to rank 3 uniform oriented matroids, put forward by Goodman and 
Pollack [21], is true. 

We will say a few words about the strategy for the computer proof. A function 
a: (im) — {-1,1,*} will be called a partial function. We say that a partial 
function o is consistent if for each pair (J, A) the inequality i(7,A) holds whenever 
the restriction of o to the variables involved in the inequality has range contained 
in {—1, 1}. A partial function o’ agrees with o if o’(t) = o(t) whenever o(t) € 
{—1, 1}. We say that a partial function o is feasible if there is a consistent function 
o’ with range {—1, 1} that agrees with o. The constant function oo that sends every 
triple to * is consistent, and the goal is to show that it is not feasible. 

In the proof of Theorem 6 above, we started with the constant partial function 
0o, and picked a function o’ that agrees with it, by changing oo({1, 2, 3}) to 1. The 
subsequent derivation established that o’ was infeasible. If we went on to show that 
the function o” obtained from oo by changing oo({1, 2, 3}) to —1 is infeasible, then 
we would have established that oo is infeasible. This last step is not necessary to 
verify, by symmetry. 

Szekeres and Peters determined that there are only 892 possible restrictions of 
a function o : (el) — {-1, 1} to the set of triples in a 6-element subset J of [m] 
that are 4-realizable and satisfy the system i(/). (Compare 892 to the cardinality 
2() of the family of all o : () — {-1,1}). Their computer search alternates 
between finding, for a given consistent partial function o, a partial function o’ that 
agrees with both o and the restriction of a 4-realizable function satisfying i(/) to 
a contiguous subset J = {i),7, + 1,...,é, + 5} of [17], and determining if the 
o’ obtained is consistent. Three independent implementations of this strategy, each 
differing slightly in the details, were made, one each by Szekeres, Peters, and by 
Brendan McKay. Each arrived at the conclusion that there is no feasible function 
oO: (U7) — {-1, 1}. Subsequent work by Koshelev [32] (see Sect. 3) independently 
verified the result. 

The work of Szekeres and Peters leaves us with the challenges: 


1. Find a noncomputer proof that N(5) = 9 holds, 
2. Find a proof (computer or otherwise) that N(6) = 17 holds. 


Lemma | can be applied directly to show that N(n) < (22) + 1, as was pointed 


out in [18]. It is not obvious that the improved arguments from [44] can be adapted 
to show that N(n) is bounded by any smaller function of n. 


2.4 Ramsey Theory 


The first proof of the existence of N(m) involved rediscovering Ramsey’s theorem. 
The bounds given by Ramsey’s Theorem tend to be much larger than the ones given 
in previous sections. 
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Theorem 9 ((39]). For any positive integers k,, £,, £2,...,£,, there exists a small- 
est positive integer mo Satisfying the following condition. For any integer m = mo, if 
the k-element subsets of |m] are colored with colors 1,2,...,1, then there exists an 


i, 1 <i<r, and an £;-element subset T C [m] so that each of the k-element subsets 
of T is i-colored. 


The smallest number mo for which the conclusion of Ramsey’s theorem holds is 
usually denoted by R, (£1, £2,...,£,) or, if all of the £; are equal to £, Ry (7; £). 


Theorem 10. For any positive integer n > 3, the number N(n) exists and 
N(n) < min{R4(n, 5), R3(n,n)}. 


Proof. Suppose X is a set of R4(n, 5) points in general position in the plane. Color a 
subset of four points from X with color | if the four points are convexly independent, 
and with color 2 otherwise. Because N(4) = 5, there is no set of 5 points with 
all 4-element subsets colored with color 2. Therefore, there is a set of n points 
of X with every 4-element subset convexly independent. It is easy to see (using a 
planar version of Carathéodory’s theorem [10]) that this n-element set must itself be 
convexly independent. Summing up, N() < Ra(n, 5). 

For the second inequality, let X¥ = {pi,p2,...,Pm} be a set of points in general 
position in the plane, with m > R3(n,n). Color a 3-element subset {p;, pj, px} C X, 
i < j < k, with color | if one encounters the points in the order (p;, pj, px) by 
passing clockwise around their convex hull. Color the subset with color 2 otherwise. 
An n-element subset of X is convexly independent if all of its 3-element subsets are 
colored with the same color. Hence N(n) < R3(n, n). Oo 


The first statement of Theorem 10 gives Szekeres’ original proof of the existence 
of N(n) for all n > 3, while the second statement is due to Tarsy (see [33]). It seems 
that Tarsy’s proof is most relevant for the evolution of the theory. Another way to 
see that there is no set of five points with all 4-element subsets colored with color 
2 in Szekeres’ proof is that the no two of the six line segments formed by pairs of 
vertices in a 4-element set of color 2 cross. A set of five points with all 4-element 
subsets of color 2 would therefore yield a planar drawing of the complete graph Ks. 
This proof that V(4) = 5 was shown to us by H. Tverberg (private communication). 

The particular set of n convexly independent points guaranteed by the monochro- 
matic set of triples depends on the ordering of the elements of X. If the points of 
X are listed in order of increasing x coordinate, then the 3-element subsets of an 
n-element subset Y of X are all of color 1 (color 2) if and only if the points of Y 
form an n-cap (n-cup). Thus |X| > R3(n,n) implies the existence of an n-cup or an 
n-cap, but also many other convexly independent sets. 

The best known bounds on R3(n, n) are bn? < R3(n,n) < 2° for some constants 
b and c (see [12]). 

One reason why the number R3(n,n) is so much larger than N(n) is the 
requirement of a monochromatic clique in the hypergraph. For a realizable set, there 
are sparse monochromatic subhypergraphs which, together with the 4-realizability 
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conditions, guarantee the existence of a monochromatic clique in the 3-regular 
hypergraph. Szekeres and Peters [43] looked for a solution to an equality e(/, A), as 
shown in the previous section. The special cases A = J and A = {ij, i,} correspond 
to minimal evidence for existence of an n-cup and an n-cap. The other values of A 
involve triples for which o is both 1 and —1, so they do not fit so neatly into the 
Ramsey theory framework in which monochromatic subsets of triples are sought. 

Fox et al. [18] pursued this direction further, going beyond the planar case and 
allowing more than 2 colors. For a sequence of positive integers j) < j2 <++: < jn, 
the k-tuples 


Giditis+- dite), §=1,2,...,.n-k4+1, 


are said to form a monotone path of length n. Note that the case k = 3 gives the 
collection of triples appearing in e(/,/) and e(/, {j1,jn}) for 7 = {f1,j2,...,Jn}. Let 
Ni(q,n) be the smallest integer NV such that for every coloring of the edges of the 
complete k-regular hypergraph on N vertices there is a monochromatic monotone 
path of length n. It is proved in [18] that 


yal < N3(q,n) < grt! logn | 


One should also mention that paths in 3-regular hypergraphs, without the 
monotonicity requirement, have also been studied. The paper of Haxell et al. [23] 
shows that only a linear number in n of points is needed to guarantee the existence 
of a monochromatic sequence ({j;, ji+1,ji+2}), i = 1,2,...,n — 2. However, the 
absence of the condition j, < jz < --- <j, means that this type of sequence is not 
sufficient to guarantee a convex n-gon in the realizable case. 

Johnson’s [25] Ramsey-theoretic proof colors a 3-element subset S of a planar 
point set X in general position with color | if there is an even number of points of 
X in the interior of conv S, and with color 2 otherwise. Again it is easy to show that 
a complete monochromatic hypergraph with n vertices corresponds to a convexly 
independent n-element subset of X. 

The paper of Alon et al. [1] considered colorings of ordered triples rather than 
unordered triples. It defines a happy-end space to be a set S along with a function 
f:SxSxS — {1,—-1} such that f(x, y, z) = f(y,x, z) for all x, y, z. A subset C 
of S is called convex if and only if for every subset B of C such that |B| > 2 and 
every point x of B there is a y # x in B so that f(x, y, z) is constant on B \ {x, y}. 
Every set S of points in general position in the plane where no two have the same 
first coordinate defines a happy-end space by f(x, y, z) = | if point z lies above the 
line xy and f(x, y,z) = —1 if point z lies below the line xy. Here a set is convex 
if and only if it consists of vertices of a convex polygon. Alon et al. show that if S 
is a happy-end space with at least R3(n; 8) points, then it contains a convex n-set. 
They also show that for every positive integer n there is a happy-end space on 22") 
points that contains no convex set of points. 
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3 The Erdos Problem on Empty Convex Polygons 


In 1978 Erdés [15] posed a new problem on convex polygons. 


Problem 2. For any positive integer n > 3, determine the smallest positive integer 
H(n), if it exists, such that any finite set X of at least H(n) points in general position 
in the plane contains n points which are the vertices of an empty convex polygon, 
i.e., a polygon whose interior does not contain any point of X. 


Bukh and MatouSek [9] express the suspicion that Problem 2 is a substantially 
harder kind of problem than Problem 1. We will see that known values of N(1) serve 
as tools for bounding corresponding values H(n). 

Trivially, H(3) = 3, and the argument of Esther Klein (see Fig. 1) shows that 
(4) = 5. The next value of H(n) was determined by Harborth [22]. 


Theorem 11 ({22]). H(5) = 10. 


The argument of Harborth is as follows. The inequality H(5) > 10 immediately 
follows from Fig.6, where a set of nine points in general position determines no 
empty convex pentagon (there are still two convex pentagons, neither being empty). 

For the opposite inequality, choose a finite set X of n (= 10) points in general 
position in the plane, then, using Theorem 5, one can find in X,, a vertex set F = 
{P1, P2; P3,P4, Ps} of a convex pentagon P = conv F. Among all such pentagons, 
there is at least one whose interior contains at most one point from X. If no such 
point exists in the interior of P, then F is the desired set. Assuming that the interior 
of P contains exactly one point from X, all other points of X should be outside P. 
A carefully study of possible placements of the latter points reveals the existence 
of five points from X which are the vertices of an empty convex pentagon, which 
results in the inequality H(5) < 10. 

In 1983 Horton [24] showed that H(n) does not exist for all n > 7. Valtr [46] 
gave a simple inductive construction of point sets, called Horton sets, that do not 
contain empty 7-gons. The empty set and a one-point set are Horton sets. A Horton 
set His in general position in the plane, with distinct x-coordinates. Furthermore, H 
can be partitioned into two sets A and B such that 


1. Each of A and B is a Horton set. 


Fig. 6 A set of nine points e 
with no empty convex 
pentagon 


The Erddés-Szekeres Problem 365 


Fig. 7 A Horton set e ‘ e ‘. 


2. The set A is below any line connecting two points of B, and the set B lies above 
any line connecting two points of A. 
3. The x-coordinates of the points of A and B alternate. 


It follows by induction on 7 that H contains no empty 7-gon. If a 7-gon P in H 
contained points from both A and B, then one of |P M A| and |P MN B| would be at 
least four. One could then find a point of H in the interior of P (Fig. 7). 

The existence of H(6) was a major open problem for many years. The lower 
bound, H(6) > 30, was established by Overmars [38], whose computer program 
produced a set of 29 points in the plane with no empty convex 6-gon. The finiteness 
of H(6) was established independently in 2005 by Gerken [19] and Nicolas [37]. 
The proof of [37] shows that H(6) < N(25), while that of [19] gives a better 
inequality H(6) < N(9). 

A more easily readable manuscript of Valtr [48] simplified Gerken’s proof and 
provided a key lemma to describe point sets with no empty 6-gon. A convex h-gon 
H with vertices in X is called minimal if there is no convex h-gon Y, with vertices 
in X, contained in H and distinct from H. Let H be a minimal h-gon in X, and let J 
be the intersection of X with the interior of conv H. Denote by J the intersection of 
X with the interior of conv /. Valtr’s lemma then states that if h > 7 and X contains 
no empty 6-gon, then there are no points of X in the interior of conv J. Combined 
with the bound on N(8) obtained by T6th and Valtr [45], this argument gives the 
inequality H(6) < 463. 

This result was followed by the improvement of Koshelev [31], who showed that 
H(6) < max{N(8), 400}. The proof of Koshelev investigates each of the possible 
triples (h, i,j) of cardinalities of the boundaries of the above sets H,/,J with h = 8, 
and in each case shows that either X has an empty 6-gon or |X| < 400. 

In this regard, we observe that the approach of Szekeres and Peters [43] to prove 
that V(6) = 17 was followed by Koshelev [32]. He proves that every set X of 17 
points contains a convex 6-gon with at most 2 points of X in its interior. He also 
gives an example of a 17-point set X which does not have a convex 6-gon with at 
most one point of X in its interior, and proves that every 18-point set X contains 
a convex 6-gon with at most one point of X in its interior. One can speculate on 
the existence of a number H(n,k) for which every point set X in general position 
in the plane, with at least H(n,k) points, contains the vertex set of a convex n-gon 
with at most k points of X in its interior. Some results for n = 7 were obtained by 
Sendov [40]. 
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4 Higher Dimensional Extensions 


This section deals with a generalization of the Erdés-Szekeres problem (as well as 
the problem on empty convex polygons) for higher dimensions. 


4.1 Convex Polytopes 


An observation that Problem | can be generalized for higher dimensions was 
already mentioned by its authors (see [16]) and later rediscovered by Griinbaum [20, 
pp. 22-23]. We recall that a set X of points in the Euclidean space E¢ is in general 
position if no d + 1 points of X lie in a hyperplane. Furthermore, X is said to be 
convexly independent if no point of X lies in the convex hull of the remaining points. 
In other words, a set X in general position is convexly independent provided it is the 
vertex set of a convex d-polytope in E?. 

Following Griinbaum [20], we define Ny(n), n > d+ 1, as the smallest positive 
integer such that any set of N,(n) points in general position in E4 contains a subset 
of n convexly independent points. Similarly to the planar case, one can pose the 
following two questions. 


1. Do the numbers N,(n) exist for all n > d+ 1? 
2. If yes, what are the values of Na(n)? 


The theorem below shows the existence of N4(n), establishing an upper bound in 
terms of Ramsey numbers. It is a direct generalization of the proof of [16] for the 
planar case. 


Theorem 12. For any positive integer n > d + 1, the number Nqa(n) exists and 
Na(n) < Ra+o(n, d+ 3). 


Proof. We observe first that any set Z C E¢ of d + 3 points in general position 
contains a subset of d+ 2 convexly independent points. Indeed, consider the convex 
hull P of Z and denote by V the vertex set of P. Clearly, V C Z. Since the case 
|V| => d+ 2 is obvious, we may assume that |V| = d + 1, that is, P is a d-simplex. 
Let u, v be the points from Z \ V. The line / through u, v meets at most two facets of 
P. If Q is a facet of P disjoint from J, then the union of {u, v} and the d vertices of 
Q form a desired convexly independent subset of Z of cardinality d + 2. 

Next, we state that a finite set F C E“ in general position is convexly independent 
if and only if each of its subsets of d + 2 points is convexly independent. The “if 
part” is trivial, so it suffices to prove the “only if” part. Indeed, if F is not convexly 
independent, then p € conv(F \ {p}) for a certain point p € F. According to 
Carathéodory’s theorem [10], F \ {p} contains a subset G of d + 1 points such 
that p € conv G. Consequently, {p} U G is a convexly dependent subset of F of 
cardinality d + 2. 
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Finally, let X C E‘ be a set in general position consisting of Ra42(n,d + 3) 
points. Color the (d + 2)-element subsets of X red if the points are convexly 
independent, and color them blue otherwise. The above argument shows that it is 
impossible for all of the (d + 2)-element subsets of a (d + 3)-element subset of X 
to be blue. Hence it must be true that X contains an n-element subset Y for which 
all (d + 2)-element subsets are red. Therefore Y is convexly independent. Hence 
Na(n) < Rat2(n,d + 3). Oo 


Tarsy’s Ramsey-theoretic proof from the planar case was generalized in The- 
orem 9.4.7 of [5]. If a point set X C E? in general position contains R z+ (n, 1) 
points, then X contains the n vertices of a cyclic d-polytope. Thus we know that X 
not only contains a convexly independent set of n points, but the combinatorial type 
of the convex hull of this point set is prescribed. Thus one can define a function 
N@(n) which is the smallest positive integer such that any set of Nj“ (n) points in 
general position in E4 contains the vertex set of a cyclic polytope with n vertices. 
Clearly, N“(n) => Na(n) and N5*°(n) = N(n). The theorem of [5] also states that 
if a combinatorial type of polytopes has the property that a large enough point set in 
general position must contain it, then it is a cyclic polytope. 

Johnson [25] also pointed out that his Ramsey-theoretic argument for the planar 
case can be generalized to higher dimensions. 

Valtr [47] gave another approach for establishing the existence of Na(n). He 
considers any set X of at least N2(n) points in general position in E% and its 
projection Y onto a two-dimensional plane L C E® such that Y is in general 
position in L. Since |Y¥| > N2(n), one can choose in Y a subset of n convexly 
independent points. It is easily seen that the prototypes of these points in X are 
convexly independent. This argument implies the inequality Nz(n) < N2(n), d => 2. 

A similar consideration is true for the case of projections of X on m-dimensional 
planes in E“, 2 < m < d. Using Theorem 4 and Valtr’s argument, one obtains the 
estimates 


2n—5 
Na(n) = Nei) S++ S Nan) < ( : 5) +1. 
n—- 
Another upper bound on Na(n) was obtained by K4rolyi [28], as shown in the 
theorem below. 


Theorem 13 ([28]). [fn > d => 3, then Na(n) < Ng-\(n — 1) + 1. Consequently, 
Na(n) < No(n—d+2)+d—2. 


Proof, Let X C E“ bea set of Ny_;(n — 1) + 1 points in general position. Choose a 
vertex p of conv X, and denote by H a hyperplane strictly separating p from X \ {p}. 
Let Y = {[p,u] 1H: u € X \ {p}}. It is easy to see that the set Y is in general 
position in H, and the mapping g : X \ {p} — Y defined by g(u) = [p,u] NHisa 
bijection. Since H can be identified with E*~!, and since |Y| > Ny-1(n — 1), there 
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is a convexly independent subset Z of Y of size n — 1. Clearly, the set {p} U g!(Z) 
is a convexly independent subset of X of size n. Oo 


A combination of Theorems 4 and 13 results in the following inequality. 


Corollary 2. ifn > d > 2, then Na(n) < ?" 4!) +d-1. 


In contrast to what is known about the planar case, it is not known if the function 
Na(n) is exponential for any fixed d > 2. The only known general lower bound for 
Na(n) is due to Karolyi and Valtr [29], given in Theorem 14 below. 


Theorem 14 ([(29]). For all integers N > d > 2, there exists a configuration of 
N points in general position in E4 which contains at most c’ (log N)*! convexly 
independent points, where c!, is a constant depending on d. Equivalently, there is a 
constant cq > 1 such that 


Na(n) _ ae”). 


Sketch of the proof of Theorem 14. A set X C E* is said to be in strongly general 
position if it is in general position and, for every f = 1,2,...,e —1, any f + 1 
points of X determine an f-dimensional affine subspace which is not parallel to the 
(e — f)-dimensional subspace of E* spanned by its last e —f coordinate axes, e < d. 
Denote by mc(X) the maximum size of a convexly independent subset of X. 


Next, we say that two finite sets in general position and of equal size, have 
the same order type if there is a one-to-one correspondence between them which 
preserves the orientation of each (e + 1)-tuple. It is clear that small perturbations 
of the sets do not affect the order type. More precisely, it is possible to show that 
for every finite set X = {pi,p2,...,p1} in general position in E’, e < d, there is a 
largest scalar 6 = 5,(X) > 0 such that whenever Y = {q1, q2,....q:} C E* satisfies 
lp; — q)| < 6 for all 1 < i< tand 1 <j < e, then ¥ also is in general position and 
has the same order type as X. (Here p; = (p}, p?,...,p') and qi = (q},.q7,....q")) 
In particular, X is convexly independent if and only if Y is. 

For a set X = {p1,..., pr}, let 


€-(X) = min{d;(m/(X)): 1 < f < e}, 


where zz is the orthogonal projection of E¢ onto E’, and d¢(z,(X)) is defined as 
above. Given a scalar 0 < € < €,(X), chose for every point p € X a vector v(p) = 


(v!(p),..., v°(p)) such that 
0 <vl(—p) <--- < v°(p) and v/(p) < ev/t!(p) forall 1 <f <e. 
These vectors v(p1),..., u(p;) can be chosen in such a way that the set X’ = {p + 


v(p) : p € X} of size 2|X| is in strongly general position; in the latter case X’ is 
called an €-double of X. Important properties of e-doubles: 
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1. If X’ is an €-double of X, then z,(X’) is an €-double of z,/(X) for all 1 <f <e, 
2. If X is in strongly general position and 0 < € < €,(X) is small enough, then 


me(X’) < me(X) + me(m,-1(X)). 


Finally, a desired set is constructed inductively, starting with a singleton Xp C E?. 
If for some i > 0, a set X; of points in strongly general position is selected, then a 
very small €; > O and an e;-double X} of X; can be chosen so that 1e(X!) is an 
€;-double of ,(X;) for all 1 < e < d and 


me(z,(X")) < me(z,-(X;)) + me(z,-1(X)), 2<e <d. 
Based on the latter inequality, a double induction on e and i gives 
me(z(X;)) < 2i°! forall 1<e<dandi>l. O 
As shown in [28], a more careful calculation yields that 
me(-(Xi)) < (2/(e— 1)! + OG). 


Thus, for large n and N, Theorem 14 is valid with cq ~ 2°3"4 and cl, = 2/(d—1)!. 

There are quite few cases when the exact values of Na(7) is known. For small 
values of n > d+ 1, we can mention the following equalities, derived by Morris and 
Soltan [36] from a stronger result of Bisztriczky and Soltan [4] (see Theorem 16 
below): 


Nin) = 2n—d—1 forall 2<d and d+2<n< |3d/2| +1. 


These equalities show that Ny(n) behaves as a linear function with respect to n for 
small values of n. 

For n > |3d/2| + 1 and d > 3, there is only one known value: N3(6) = 9, due 
to Bisztriczky and Soltan [4] (see Theorem 18 below). 

Griinbaum [20, pp. 22-23] discussed a variant of the Erdés-Szekeres problem 
in higher dimensions. Namely, he established the existence of a minimum number 
Ba(n), d > 2 andn > d +1, such that any set X C E? of at least By(n) points 
in general position contains a subset of n points lying on the boundary of a convex 
body. His proof is based on Ramsey’s theorem and the following assertion: a finite 
set Y C E@ lies on the boundary of a convex body in E¢ if and only if each of its 
subsets of at most 2d + 1 points lies on the boundary of a convex body in E%. The 
last statement is a direct consequence of the Steinitz theorem (see [41, Sect. 10]): a 
point p € E% belongs to the interior of the convex hull of a set S C E‘ if and only if p 
belongs to the interior of the convex hull of at most 2d points of S. Later Bisztriczky 
and Soltan [4] showed that in the definition of By(n) the set X C E¢ can be arbitrary 


370 W. Morris and V. Soltan 


(not necessarily in general position); they also proved the equality By(n) = Ny(n) 
for all d > 2 andn > d + 1, based on the simple argument that any finite set in E¢ 
can be approximated by a set in general position. 


4.2 Empty Convex Polytopes 


Generalizing the Erdés problem on empty convex polygons (see Sect.3), Bisz- 
triczky and Soltan [4] defined Hz(n) as the smallest positive integer, if it exists, such 
that any set X of at least H,(n) points in general position in E% contains a subset 
of n points that are the vertices of an empty convex polytope, that is, of a polytope 
whose interior does not contain any point of X. We observe that in this definition one 
may consider only sets X of precisely Ha(n) points. Indeed, if X contains more than 
H,(n) points, we can partition X into subset Y and Z by a suitable hyperplane such 
that |Y| = H,(n). Then, if some n points from Y form an empty convex polytope 
with respect to Y, then this polytope also is empty with respect to X. 

The following theorem of Valtr [47] (formulated in slightly distinct terms) shows 
the existence of numbers H,(n) for alld + 1<n< 2d+1. 


Theorem 15. Ha(2d + 1) < Na(4d + 1) forall d> 2. 


Proof. Let X C E® be a set of Nj(4d + 1) points in general position. Choose a 
subset Y C X of 4d + 1 convexly independent points such that the interior of the 
convex polytope P = conv Y contains the smallest number of points from X. Let 
Z=XNintPandk = |Z|. 

If k < d, then we consider a hyperplane H containing Z. Since the set X \ Z 
contains at least 4d — k + | points and lies in the union of two open halfspaces 
determined by H, at least one of these halfspaces contains a subset V of X \ Z 
consisting of at least [(4d — k + 1)/2] points. It is easy to see that the set V U Z is 
convexly independent, contains at least [(4d —k + 1)/2] +k = 2d + 1 points, and 
is the vertex set of an empty polytope. 

Suppose that k > d+ 1 and let Q = conv Z. Choose a facet F of Q and consider 
the set Yo of all points of Y which belong to the open halfspace C determined by the 
hyperplane H containing F but not Q. We state that | Yo| > d+ 1. Indeed, assume 
for a moment that |Yo| = m < d. If Gis a subset of FM Z of m points, then the 
set Y’ = GU(Y NC’), where C’ is the second open halfspace determined by H, 
consists of 4d + 1 convexly independent points such that the interior of the polytope 
conv Y’ contains less then k points of X. Since the latter contradicts the choice of Y, 
we conclude that |Yo| > d+ 1. This argument shows that Yo U (F'N Z) is a convexly 
independent set of at least 2d + 1 points which form an empty convex polytope. O 


Using an n-dimensional version of Horton sets (see [24]), Valtr [47] gave an 
example of an arbitrary large set X C E* in general position containing no 
convexly independent subset of size 24“! (r2r3 --- rqg-+ 1) which determines an empty 
convex polytope with respect to X (here 7.2 = 2,73 = 3,r4 = 5,... is the 
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sequence of prime numbers). Consequently, the numbers H,(n) do not exist for all 
n> 21 (ror3++-ra +1). If d = 3, then the latter statement can be improved: H3(n) 
do not exist for all n > 23 (instead of n > 27(2-3+ 1) = 28 due to the above 
argument). 

For small values of n, Bisztriczky and Soltan [4] determined a sharp upper bound 
on H,(n), formulated here in slightly distinct terms. 


Theorem 16 ([4]). If d > 2and d+1 <n < |3d/2| +1, then Hy(n) < 2n—d-1. 


Proof. Let m = 2n — d—1 and X C E‘ bea set of m points in general position. 
Set P = conv X. Denote by v the number of vertices of the polytope P. Since the 
case v = m is obvious, we assume that d+ 1 < v < m. Thenv = d +t, where 
1 < t < 2n — 2d — 1. Consequently, int P contains m—v = 2n—2d—t—1 
points from X. Choose any 3d — 2n + t + 1 vertices of P (which is possible due to 
n < |3d/2| + 1), and let H be the hyperplane through the selected 


(Qn —2d —t—1) + Bd—-In+t+l=d 


points from X. Denote by Y the set of these d vertices. Clearly, H contains no other 
point of X. Consequently, Y = HM X and the remaining 


v—(d—2n+t+ 1) =2n—-2d-1 


vertices of P lie in E¢ \ H. Hence there is a subset Z of at least n — d vertices of 
P which belong to an open halfplane determined by H. It is easy to see that the 
n-element set Y U Z is the vertex set of an empty convex polytope. Oo 


Later Bisztriczky and Harborth [3] proved the opposite inequality. 


Theorem 17 ((3]). Jf Ha(n) exists, then Ha(n) > 2n — d —1, where d > 2 and 
n>=d+l. 


Proof. Since Hy(d + 1) = d+ 1 and Hy(d + 2) = d+ 3, we may assume that 
n>d+3.LetX C E! be aset in general position, and 1 < t < |X| be an integer. 
We will say that X is t-fat provided 

A (int(conv Y) : Y CX, |¥| = |X|—1) £9. 


Let L,(t) denote the smallest integer / such that there exists a t-fat set X C E? of 
cardinality 7. As shown in [4], 


La(t) = d+2t+1 whenever d>2 and t> 1. 


Put t = n—d—2 and choose a t-fat set X C E” of cardinality d+ 2t+ 1. Then there 
is a point 


p€U=N(int(convY): Y CX, |¥| =d+r+4 1). 
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Since U is an open set, we may choose p so that the set X* = X U {p} is in general 
position. Then |X*| = d+ 2t+ 2 and p belongs to the interior of every convex 
polytope conv Y, where Y C X* and |Y| = d+t+2 =n. Hence no subset of n 
points from X* forms an empty convex polytope. Consequently, 


Ha(n) > |X*|4+1=d+2t+3=2n-—d-1. O 
Combining Theorems 16 and 17, we obtain the following equalities: 
Ha(n) = 2n—d—1 forall d>2 and d+1<n< |3d/2| +1. 


For d > 2 andn > |3d/2]| + 1, only one value of Hz(n) is known: H3(6) = 9, 
as shown in the theorem below. 
Theorem 18 ([4]). N3(6) = H3(6) = 9. 


Proof. Due to the obvious inequality N3(6) < H3(6), it suffices to show that 9 < 
N3(6) and H3(6) < 9. 


Case 1. 9 <N;(6). For this, let 
Pi = (—64,0,0), po=(0, 64,0), p3=64,0,0), ps=(0, 0, 128), 


ps = (—#.8+5,24-4), po=(- 3.8+ §,24- 8).pr=(16. 2, 4), 


and pg = (0,32(1 — €), 64(1 — €)), where € is an arbitrary small real number. 
It is a matter of computation to verify that the set X = {p1,po,...,ps} is in 
general position and has the following properties: 


. conv X = conv {p1, p2, p3, ps}, 

. {ps, Po} C conv {p1, 3, Pa, P7} M conv {p1, P3, Pa, Ps} 

. ps € conv {p2, P3, Pa, Po} N conv {p3, pa, Po, P7}, 

. Po € conv {p1, Po, Ps, ps} NM conv {p1, 3, Ps, ps} N conv {p1, pa, ps, pr}, 

. P7 © conv {p1, Po, P3, ps} MN conv {po, p3, Pa, Po} N conv {p2, p3, Ps, Po}, 

. Pg € conv {p2, Pa, Ps, 7} N conv {po, pa, Po, P7}- 

Consequently, no six-element subset of X is the vertex set of an empty convex 
polytope. 

Case2. H3(6) < 9. Let X = {p,p2,...,po} be any set of nine points in general 
position in E*. Put P = conv X, and denote by Y the vertex set of P. Without loss 
of generality, we assume that Y = {p,,p,...,p,}, where 4 < r < 9. Since the 
case r = 9 is obvious (any six points from Y are convexly independent), we let 
r <8. Put Z =X \ Y and T = convZ. 


Nn BWNY 


a) Ifr > 6, then we argue as in the proof of Theorem 16 and obtain the existence 
of an empty polytope with six vertices from X. 
b) Let r = 5. Then |Z| = 4 and T is a 3-simplex. Obviously, T C int P. 
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Suppose first that the line = (p;,p;) meets intT ¢ @ for some p; € X and 
p; € Y. Let! = (ps, po). Then / meets the relative interior of conv {p7, pg, Po}- 
Denote by L; the closed halfplane bounded by / and containing p;, i = 7, 8, 9. 
It is clear that E? \ (L7 U Lg U Lo) is the union of three open convex regions, 
A,,A2,A3, one of them containing at least two points from {p1, p2,p3, p4}- 
Assume that namely A; is bounded by L7, Lg and contains two points, say 
P1,P2, from {pj, p2, p3, pas. Then, as easy to see, the set {p1, p2, Ps, Po. P7, Ps} 
generates an empty polytope with six vertices. 

Suppose now that 


(pi, pj) NintT = @ forall p; € X and p; € Y. (1) 


Translating X on a suitable vector, we may assume that pg is the origin of E’. 
Since X is in general position, {p7, pg, po} is a basis for E>. Choosing a new 
coordinate system, we let p7 = (1,0,0), pg = (0,1,0), and po = (0,0, 1). 
The three coordinate planes decompose E? into eight pairwise disjoint open 
octants O(a), 2,03), where a; = +1 is the sign of the ith coordinate of a 
point in O(a, a2, a3). 

Next, let H denote the plane through {p7, ps, po}; also, H; and H> be the 
opposite open halfspace determined by H such that ps € H7. It is a direct 
consequence of (1) that 


cn 60. U0G. = UGG), 
XA COS) UO 4=)00G =). 
Put 
O(a), 02,03) = HK O(a), 02,03), k = 1,2. 
Then Y lies in the union of six regions, 


o'(+,+,-), o'(+,-, +), Ol(-, +, +), 
Q’(-,-, +), Q’(-,+,-), Q’(+,-,-). 
If one of these six regions contains two points, say p; and p2, from {p1, p2, p3}, 


then conv({p1, p2}UZ) is an empty convex polytope with six vertices. Assume 
this is not the case, and let, for instance, 


pie O(+.4.—-), po € O(-,+.-), p3 € O'(4,-.-). 


In this case, the plane H strictly separates {po} and {p1, p2, p3}, which shows 
that the convex polytope conv {p1, P2, 23, Po, P7, Ps} is empty. 
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c) Let r = 4. Then |Z| = 5. If the polytope T = conv Z has five vertices, then it 
is the union of two non-overlapping simplices 


conv {p5,P7,Ps,po} and conv {p6, p7. ps, Po}. 


Then the line 1 = (ps,po) meets the relative interior of the triangle 
conv {p7, ps, Po}. 

If T has four vertices, then it is a simplex. In this case, we may assume ps € 
int T and whence the line / = (ps, p¢) still has the above property. In either 
case, we define the halfplanes L;, i = 7, 8, 9, and argue as in part b). oO 


Since the number H3(6) can be viewed as a particular case of Hy(|3d/2]| + 2), 


we pose the question on the values of Hy(|3d/2| + 2) for all d > 4. 
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Novikov’s Conjecture 


Jonathan Rosenberg 


Abstract We describe Novikov’s “higher signature conjecture,” which dates back 
to the late 1960s, as well as many alternative formulations and related problems. 
The Novikov Conjecture is perhaps the most important unsolved problem in 
high-dimensional manifold topology, but more importantly, variants and analogues 
permeate many other areas of mathematics, from geometry to operator algebras to 
representation theory. 


1 Origins of the Original Conjecture 


The Novikov Conjecture is perhaps the most important unsolved problem in the 
topology of high-dimensional manifolds. It was first stated by Sergei Novikov, in 
various forms, in his lectures at the International Congresses of Mathematicians in 
Moscow in 1966 and in Nice in 1970, and in a few other papers [85-88]. For an 
annotated version of the original formulation, in both Russian and English, we refer 
the reader to [37]. Here we will try instead to put the problem in context and explain 
why it might be of interest to the average mathematician. For a nice book-length 
exposition of this subject, we recommend [66]. Many treatments of various aspects 
of the problem can also be found in the many papers in the collections [38, 39]. 

For the typical mathematician, the most important topological spaces are smooth 
manifolds, which were introduced by Riemann in the 1850s. However, it took about 
100 years for the tools for classifying manifolds (except in dimension 1, which is 
trivial, and dimension 2, which is relatively easy) to be developed. The problem is 
that manifolds have no local invariants (except for the dimension); all manifolds of 
the same dimension look the same Jocally. Certainly many different manifolds were 
known, but how can one tell whether or not the known examples are “typical”? How 
can one distinguish one manifold from another? 
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With big leaps forward in topology in the 1950s, it finally became possible to 
answer these questions, at least in part. Here were a few critical ingredients: 


1. the development of the theory of Reidemeister and Whitehead torsion and the 
related notion of “simple homotopy equivalence” (see [77] for a good survey of 
all of this); 

2. the theory of characteristic classes of vector bundles, developed by Chern, Weil, 
Pontrjagin, and others; 

3. the notion of cobordism, introduced by Thom [112], who also provided a method 
for computing it; 

4. the Hirzebruch signature theorem sign(M) = (2(M),[M]) [54], giving a 
formula for the signature of an oriented closed manifold M“** (this is the algebraic 
signature of the nondegenerate symmetric bilinear form (x, y) +> (x U y, [M]) on 
H*« coming from Poincaré duality), in terms of a certain polynomial “(M) in 
the rational Pontrjagin classes of the tangent bundle. 


Using just these ingredients, Milnor [74] was able to show that there are at 
least 7 different diffeomorphism classes of 7-manifolds homotopy equivalent to S’. 
(Actually there are 28 diffeomorphism classes of such manifolds, as Milnor and 
Kervaire [65] showed a bit later.) This and the major role played by items 2 and 
4 on the above list! came as a big surprise, and showed that the classification of 
manifolds, even within a “standard” homotopy type, has to be a hard problem. 

The final two ingredients came just a bit later. One was Smale’s famous 
h-cobordism theorem, which was the main ingredient in his proof [109] of the 
high-dimensional Poincaré conjecture in the topological category. (In other words, 
if M” is a smooth compact n-manifold, n > 5, homotopy equivalent to S”, then 
M is homeomorphic to S$”, even though it may not be diffeomorphic to it.) But 
from the point of view of the general manifold classification program, Smale’s 
important contribution was a criterion for telling when two manifolds really are 
diffeomorphic to one another. An h-cobordism between compact manifolds M and 
M’ is acompact manifold with boundary W, such that 0W = MUM’ and such that W 
has deformation retractions down to both M and M’. The h-cobordism theorem [76] 
says that if dim M = dim M’ > 5 and if M, M’, and W are simply connected, then W 
is diffeomorphic to M x [0, 1], and in particular, M and M’ are diffeomorphic. The 
advantage of this is that diffeomorphisms between different manifolds are usually 
very hard to construct directly; it is much easier to construct an h-cobordism. 

If one dispenses with simple connectivity, then an h-cobordism between M and 
M’ need not be diffeomorphic to a product M x [0, 1]. However, the s-cobordism 
theorem, due to Barden, Mazur, and Stallings, with simplifications due to Kervaire 


‘Spheres have stably trivial tangent bundle and no interesting cohomology, so one’s first guess 
might be that the theory of vector bundles and the signature theorem might be irrelevant to studying 
homotopy spheres. Milnor, however, showed that one can construct lots of manifolds with the 
homotopy type of a 7-sphere as unit sphere bundles in rank-4 vector bundles over S*. He also 
showed that the signature of an 8-manifold bounded by such a manifold yields lots of information 
about the homotopy sphere. 
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[64], says that the h-cobordisms themselves are classifiable by the Whitehead 
torsion t(W,M), which takes values in the Whitehead group Wh(z), where 
mt = 7\(M), and all values in Wh(zr) can be realized by h-cobordisms. (The 
Whitehead group is the quotient of the algebraic K-group K,(Zz) by its “obvious” 
subgroup {+1} x zp.) Thus an h-cobordism is a product if Wh(z1) = 0, which is 
the case for z free abelian, and in fact is conjectured to be the case if z is torsion- 
free. But for z finite, for example, Wh(z) is a finitely generated group of rank 
r — q, where r is the number of irreducible real representations of z, and q is the 
number of irreducible rational representations of a [77, Theorem 6.2]. This number 
r — q is usually positive (for example, when z is finite cyclic, it vanishes only if 
|r| = 1,2,3,4, or 6). Bass and Murthy have even shown [8] that there are finitely 
generated abelian groups z for which Wh(z:) is not finitely generated. 

The last major ingredient for the classification of manifolds is the method of 
surgery. Surgery on an n-manifold M” means cutting out a neighborhood S* x D’~* 
of a k-sphere S‘ <> M (with trivial normal bundle) and replacing it by D‘*! x 
5”-k-|, which has the same boundary. This can be used to modify a manifold 
without changing its bordism class, and was first introduced by Milnor [75] and 
Wallace [117]. 

With the help of all of these techniques, Browder [20, 21] and Novikov 
[81, 82] finally introduced a general methodology for classifying manifolds in high 
dimensions. The method gave complete results for simply connected manifolds 
in dimensions > 5, and only partial information in dimensions 3 and 4, which 
have their own peculiarities we won’t discuss here. With the help of additional 
contributions by Sullivan [111], Novikov [86], and above all, Wall [115], this 
method grew into what we know today as surgery theory, codified by Wall in 
his book [116], which originally appeared in 1970. There are now fairly good 
expositions of the theory, for example in Ranicki’s books [94, 95], in the book by 
Kreck and Liick [66], in the first half of Weinberger’s book [119], and in Browder’s 
colloquium lectures from 1977 [22], so we won’t attempt to compete by going into 
details, which anyway would take far too many pages. Instead we will just outline 
enough of the ideas to set the stage for Novikov’s conjecture. 

As we indicated before, surgery theory addresses the uniqueness question for 
manifolds: given (closed and connected, say) manifolds M and M’ of the same 
dimension n, when are they diffeomorphic (or homeomorphic)? It also addresses 
an existence question: given a connected topological space X (say a finite CW 
complex), when is it homotopy equivalent to a (closed) manifold? 

A few necessary conditions are evident from a first course in topology. If M@ 
and M' are diffeomorphic, then certainly they are homotopy equivalent, and so 
they have the same fundamental group z. Furthermore, if a finite connected CW 
complex X has the homotopy type of a closed manifold, then it has to satisfy 
Poincaré duality, even in the strong sense of (possibly twisted) Poincaré duality 
of the universal cover with coefficients in Zx. Homotopy equivalences preserve 
homology and cohomology groups and cup products, so an orientation-preserving 
homotopy equivalence also preserves the signature (in dimensions divisible by 4 
when the signature is defined). However, these conditions are not nearly enough. 
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For one thing, for a homotopy equivalence to be homotopic to a diffeomorphism 
(or even a homeomorphism), it has to be simple, i.e., to have vanishing torsion in 
Wh(zr). Depending on the fundamental group z, this may or may not be a serious 
restriction. 

But the most serious conditions involve characteristic classes of the tangent 
bundle. Via a very ingenious argument using surgery theory and the Hirzebruch 
signature theorem, Novikov [83, 84] showed that the rational Pontrjagin classes 
of the tangent bundle of a manifold are preserved under homeomorphisms.” 
(Incidentally, Gromov [45, Sect. 7] has given a totally different short argument for 
this.) The rational Pontrjagin classes do not have to be preserved under homotopy 
equivalences. So if g: M — M’ is a homotopy equivalence not preserving rational 
Pontrjagin classes, it cannot be homotopic to a homeomorphism. 

In the simply connected case, this is (modulo finite ambiguity) just about all: 
if M’ — M is an orientation-preserving homotopy equivalence of closed simply 
connected oriented manifolds, the rational Pontrjagin classes of M’ have to satisfy 
the constraint (.7(M’), [M"]) = sign(M’) = sign(M) imposed by the Hirzebruch 
signature theorem, but otherwise they are effectively unconstrained (assuming the 
dimension of the manifold is at least 5).> And if the map does preserve rational 
Pontrjagin classes, then there are only finitely many possibilities for M’ up to 
diffeomorphism. 

When M is not simply connected, the situation is appreciably more complicated. 
Suppose one wants to check if two n-manifolds M and M’ are diffeomorphic. As 
we indicated before, that means we need to have a simple homotopy equivalence 
gy: M’ — M. If g were homotopic to a diffeomorphism, it would preserve the 
classes of the tangent bundles, so it’s convenient to assume that g has been promoted 
to anormal map y: (M’,v’) > (M,v). Here v and v’ are the stable normal bundles 
defined via the Whitney embedding theorem: if k is large enough (n + 1 suffices), 
then M and M’ have embeddings into Euclidean space R”**, and any two such 
embeddings are isotopic, so the isomorphism class of the normal bundle v or v’ for 
such an embedding is well defined. (Because of the Thom-Pontrjagin construction, 
it’s better to work with the normal bundle than with the tangent bundle, but they 
contain the same information.) Being a normal map means that g has been extended 
to a bundle map from v’ to v, which we can assume is an isomorphism on fibers. 
The idea of trying to show that M and M’ are diffeomorphic is to start with a normal 
bordism from 9 to idy, i.e., a manifold W"*! with boundary M UM’ and a map 
®: W > M x (0, 1] restricting to g and to idy on the two boundary components, 
and with a compatible map of bundles, and then to try to modify (W, ®) by surgery 


?The same does not hold for the torsion part of the Pontrjagin classes, as one can see from 
calculations with lens spaces [87, Sect. 3]. 

3A precise statement to this effect may be found in [31, Theorem 6.5]. It says for example that if M 
is a closed simply connected manifold and dim M is not divisible by 4, then for any set of elements 
x; € H4(M,Q),1<j< | one | , there is a positive integer R such that for any integer m, there is 
a homotopy equivalence of manifolds ¢,,: M/, — M such that p;(M/,) = 9; (p;(M) + mRx;). 
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to make it into an s-cobordism. Once this is accomplished, then M and M’ are 
diffeomorphic by the s-cobordism theorem. It turns out that doing the surgery is not 
difficult until one gets up to the middle dimension (if n + 1 is even) or the “almost 
middle” dimension | "4*| (if n + 1 is odd). At this point a surgery obstruction 
appears, taking its value in a group L,4 (Zz) constructed purely algebraically out 
of quadratic forms on Zz. (Roughly speaking, the L-groups are groups of stable 
equivalence classes of forms on finitely generated projective or free Z-modules, 
and the type of the form—symmetric, skew-symmetric, etc.—depends only on the 
value of n mod 4. The original construction may be found in [116].) The existence 
problem (telling if one can find a manifold homotopy equivalent to a given finite 
complex with Poincaré duality) works in a very similar way, just down in dimension 
by 1, and the surgery obstruction in that case takes its values in L,(Zz). 

Ultimately, the result of this surgery process is to prove that there is a surgery 
exact sequence for computation of the structure set (M), the set of (simple) 
homotopy equivalences gy: M’ —> M, where M’ is a smooth compact manifold, 
modulo equivalence. We say that two such maps y: M’ > M and yg’: M” > M 
are equivalent if there is a commuting diagram 


Q 
= ey 
yr 


The surgery exact sequence then takes the form 


M’ 


1 Dn (Zt) > S(M) —1> WV (M) “+L, (Zn). (1) 


Here .W (M) is the set of normal invariants, the normal bordism classes of all normal 
maps y: (M’,v') — (M,v) (not necessarily homotopy equivalences as before) 
modulo linear automorphisms of v. This can also be identified with homotopy 
classes of maps from M into a classifying space called G/O. If one works instead 
in the PL or the topological category, the same sequence (1) is valid, but G/O is 
replaced by G/PL or G/Top, which are easier to deal with,* and in fact look a lot 
like BO, the classifying space for real K-theory. The natural maps G/O > G/PL > 
G/Top are rational homotopy equivalences. The map 7: .7(M) > -V(M) sends a 
homotopy equivalence yg: M’ — M to the associated normal data. 

The groups L,(Zzr) are 4-periodic, and only depend on the fundamental group 
and some “decorations” which we are suppressing here, which only affect the 
torsion. The map a: .”(M) — L,(Zz) takes the bordism class of a normal map 
gy: (M’,v') > (M,v) to its associated surgery obstruction. When this vanishes, 
exactness of (1) says we can lift g to an element of .“(M), or in other words, we 


“Once the dimension is bigger than 4! 
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can do surgery to convert it to a homotopy equivalence. The dotted arrow from 
Ln+1(Z) to (M) signifies that the surgery group operates on .“(M) (which is 
just a pointed set, not a group) and that two elements of the structure set have the 
same normal invariant if and only if they lie in the same orbit for the action of 
Lit (Zn ) . 

The exact sequence (1) is closely related to an algebraic surgery exact sequence 


1+ Engi (Zt) > F,(M) > H,(M, LZ) > L,(Z) (2) 


constructed in [93, 95], where the map A, called the assembly map, corresponds to 
local-to-global passage. We will come back to this later. 

For most groups zr, the L-groups L.(Zz) are not easy to calculate, so a lot of 
the literature on surgery theory emphasizes things related to the exact sequence (1) 
which don’t rely on explicit calculation of all the groups. For example, sometimes 
one can compare two related surgery problems, or rely on other invariants, such as 
n- and p-invariants for finite groups. These (as well as direct calculation from (1)) 
show that there are infinitely many manifolds with the homotopy type of RP#*?, 
k > 1. In fact, it’s shown in [27] that in dimension 4k + 3, k => 1, any closed 
manifold M with torsion in its fundamental group has infinitely many distinct 
manifolds simple homotopy-equivalent to it. 

Now we are ready to explain Novikov’s conjecture. We can rewrite the Hirze- 
bruch signature theorem as saying that for a closed connected oriented manifold 
M, the 0-degree component of “(M) OM [M] in Ho(M,Q) & Q coincides with 
sign M, which is preserved by orientation-preserving homotopy equivalences. The 
components of -(M) /M [M] in other degrees have no such invariance property, and 
knowing them is equivalent to knowing the rational Pontrjagin classes. However, 
Novikov discovered in [83] (see [31, Theorem 2.1 and its proof] for a simplified 
version of his argument) that if 71(M) ~ Z, then the degree-1 component of 
-£(M) 2 [M] is also an oriented homotopy invariant. This theorem is the simplest 
special case of Novikov’s conjecture. 


Definition 1.1. Let M be a closed connected oriented manifold, and let z be a 
countable discrete group (usually taken to be the fundamental group of M). Let Bx 
be a classifying space for 2, a CW complex with contractible universal cover and 
fundamental group zr, and let f: M — Bz be a continuous map. (Up to homotopy, 
it’s determined by the induced homomorphism z;(M) — sz.) The associated higher 
signature of M is fx(-2(M) N [M]) € H.(Bz, Q). 


Conjecture 1.2 (Novikov’s Conjecture). Any higher signature f,(2(M) 
[M]) € H.(Bz, Q) is always an oriented homotopy invariant. In other words, 
if M and M' are closed connected oriented manifolds and if 9: M’ > M is 


(continued) 


Novikov’s Conjecture 383 


Conjecture 1.2 (continued) 
an orientation-preserving homotopy equivalence and f: M — Br, then 


fa 2(M) 9 [M]) = (f 0p) x(2(M’) 2 [M']) € H, (Bz, Q). 


The utility of the conjecture can be illustrated by an example. 


Problem 1.3. Classify smooth compact 5-manifolds homotopy equivalent to 
CP? x S'. (Note: the diffeomorphism classification of smooth 4-manifolds 
homotopy equivalent to CP? is not known, since surgery breaks down in the 
smooth category in dimension 4. It is known by work of Freedman [41] that up to 
homeomorphism, there are exactly two closed topological 4-manifolds homotopy 
equivalent to CP’, but for the “exotic” one, the product with S$ ' does not have a 
smooth structure.) 


Proof. Suppose M is a smooth closed manifold of the homotopy type of CP? x S!. 
There is a smooth map f: M — S! inducing an isomorphism on 7, and we can take 
this to be the map f: M — Bux, a = Z, for the case of the conjecture proven by 
Novikov himself. So the conjecture implies that if K = f—! (pt), the inverse image 
of a regular value of f, then K has signature |. This fixes the first Pontrjagin class 
of M. Furthermore, K being a smooth 4-manifold with signature 1, it is in the same 
oriented bordism class as CP?. From this we can get a normal bordism W° between 
M (with its stable normal bundle v) and CP? x S! (with its stable normal bundle €). 
We plug into the surgery machine and try to do surgery to convert this to an h- 
cobordism (and thus automatically an s-cobordism, since Wh(Z) = 0). The surgery 
obstruction lives in L¢(Z[Z]). This group turns out to be Z/2 (coming from the 
image of the Arf invariant in Lg(Z) = Z/2). So there are not a lot of possibilities. 
In fact one can show by studying the continuation of the sequence (1) to the left 
that M is diffeomorphic to CP? x S!. But note that the key ingredient in the whole 
argument is the Novikov Conjecture, which pins down the first Pontrjagin class. O 


2 Methods of Proof 


Work on the Novikov Conjecture began almost as soon as the conjecture was for- 
mulated. Roughly speaking, methods fall into three different categories: topological, 
analytic, and algebraic. The topological approach began with Novikov’s own work 
on the free abelian case of the conjecture, which we already mentioned in the case 
mz = Z, and which only uses transversality and basic homology theory. This method 
was generalized in work of Kasparov, Farrell-Hsiang, and Cappell [23, 33, 58], 
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who used codimension-one splitting methods to deal with free abelian and poly- 
Z groups, and certain kinds of amalgamated free products. 

Subsequent topological approaches to the conjecture have been based on con- 
trolled topology (if you like, a blend of analysis and topology since it amounts 
to topology with é-¢ estimates) or on various methods in stable homotopy theory. 
There is a lot more in this area than we can possibly summarize here, but it is 
discussed in detail in [37], which includes a long bibliography. 

The analytic approach began with the important contribution of Lusztig [72]. 
The key idea here is to realize the higher signature of Definition 1.1 as the index 
of a family of elliptic operators, just as Atiyah and Singer [2, Sect. 6] had reproven 
Hirzebruch’s signature theorem by realizing the signature as the index of a certain 
elliptic operator, now universally called the signature operator. (This is just the 
operator d + d* operating on differential forms, but with a grading on the forms 
coming from the Hodge *-operator.) A major step forward from the work of Lusztig 
came with the work of Mishchenko [78, 79] and Kasparov [57, 61, 62], who realized 
that one could generalize this construction by using “noncommutative” families of 
elliptic operators, based on a C*-algebra completion C* (sr) of the algebraic group 
ring Cz. Underlying this method was the idea [79, 99] that because of the inclusions 
Zn — Ca — C*(z), there is a natural map L,(Z2) —> L,(C*(z)), and that 
because the spectral theorem enables one to diagonalize quadratic forms over a C*- 
algebra, the L-groups and topological K-groups of a C*-algebra essentially coincide. 
As we will see in the next section, the analytic approach to the Novikov conjecture 
is the one that has attracted the most recent attention, though there is still plenty of 
work being done on topological and algebraic methods. 

Algebraic approaches to proving the Novikov conjecture depend on a finer 
understanding of the surgery exact sequence (1) and the L-groups. For a homotopy 
equivalence of manifolds g: M’ — M, the difference y.(2(M’)N[M'])—(LUM) nN 
[M]) € H.(M, Q) is basically n([M’ > M]) @z Q in (1). The Novikov conjecture 
says that this should vanish when we apply f,, f: M — Bz. Since we could also 
apply (1) with M replaced by Bz (at least if Ba can be chosen to be a manifold— 
but there is a way of getting around this), exactness in (1) shows that the Novikov 
Conjecture is equivalent to rational injectivity of the map a in (1), when we replace 
M by Br. 

More precisely, we need to make use of an idea of Quinn [91], that the L-groups 
are the homotopy groups of a spectrum: 


L, (Zr) = Tn (ee (Zzr)) 


and that the map a@ in the surgery exact sequence (1) comes from an assembly map 
which is the induced map on homotopy groups of a map of spectra 


Au: My AL.(Z) > L. (Zz). 


This map factors (via ff: M — Bz) through a similar map 
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Aq: Bry ALe(Z) > L.(Zz). (3) 


If A, in (3) induces a rational injection on homotopy groups, then the Novikov 
Conjecture follows from exactness of (1). On the other hand, if A, is not rationally 
injective, then one can construct an M and a higher signature for it that is not 
homotopy invariant. So the Novikov Conjecture is reduced to a statement which 
at least in principle is purely algebraic, as Ranicki in [93, 95] gives a purely 
algebraic construction of the surgery spectra and of the map A,,, leading to the exact 
sequence (2). 


3 Variations on a Theme 


One of the most interesting features of the Novikov Conjecture is that it is closely 
related to a number of other useful conjectures. Some of these are known to be true, 
some are known to be false, and most are also unsolved. But even the ones that are 
false are false for somewhat subtle reasons, and still carry some “element of truth.” 
Here we mention a number of these related conjectures and something about their 
status. 


Conjecture 3.1 (Borel’s Conjecture). Any two closed aspherical (i.e., hav- 
ing contractible universal covers) manifolds M and M’ with the same 
fundamental group are homeomorphic. In fact, any homotopy equivalence 
gy: M' — M of such manifolds is homotopic to a homeomorphism. 


This conjecture is known to have been posed informally by Armand Borel, 
before the formulation of Novikov’s Conjecture, and was motivated by the Mostow 
Rigidity Theorem. It amounts to a kind of topological rigidity for aspherical mani- 
folds. Note that if M is aspherical with fundamental group z and n = dimM >= 5, 
then we can take M = Brn, and Borel’s conjecture amounts to saying that in the 
surgery sequence (1) in the topological category, .“(M) is just a single point, or 
by exactness, the assembly map A, is an equivalence. This implies the Novikov 
Conjecture for z, but is stronger. 

Incidentally, it is known now that the analogue of Borel’s Conjecture, 
but with homeomorphism replaced by diffeomorphism, is false. The simplest 


5It turns out that (2) coincides with the analogue of (1) in the topological, rather than smooth, 
category, but the difference between these is rather small since all homotopy groups of Top/O are 
torsion. 
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counterexample is with M = 7’, the 7-torus. Since a torus is parallelizable, Wall 
pointed out in [116, Sect. 15A] that the set of smooth structures on T” compatible 
with the standard PL structure is parameterized by [T”, PL/O] (for n > 5). It is 
known that the classifying space PL/O is 6-connected and that (for j > 7) its 
jth homotopy group can be identified with the group ©; of smooth homotopy j- 
spheres.° Since ©; = Z/28 by [65, 74], the differentiable structures on 7’ are 
parameterized by [77, PL/O] = [T’, K(Q7,7)] = H’(T’,Q7) = Z/28 and there 
are 28 different differentiable structures on 7’. A series of counterexamples with 
negative curvature to the smooth Borel conjecture was constructed in [34, 35]. 

The fundamental group z of an aspherical manifold M (even if noncompact) has 
to be torsion-free, since if g € mz has finite order k > 1, it would act freely on 
the universal cover M, and M /(g) would be a finite-dimensional model for BZ/k, 
contradicting the fact that Z/k has homology in all positive odd dimensions. So 
Conjecture 3.1 can’t apply to groups with torsion. In fact, the result of [27] shows 
that for groups with torsion, A, in (3) is never an equivalence. We will come back 
to this shortly. 

However, we have already mentioned the role of the Whitehead group, which 
comes from the algebraic K-theory of Zz, in studying manifolds with fundamental 
group z. An important conjecture which we have already mentioned is: 


Conjecture 3.2 (Vanishing of Whitehead Groups). /f z is torsion-free, 
then Wh(zc) = 0. 


Note that if Conjecture 3.2 fails and z is the fundamental group of a closed 
manifold M, then by the s-cobordism theorem, there is an h-cobordism W with 
dW = MU (—M’) which is not a product, and we have a homotopy equivalence 
M’ — M which is not simple, hence Borel’s Conjecture, Conjecture 3.1, fails for M. 

More generally, one can ask what one can say about the algebraic K-theory of Zz 
in all degrees. Loday [69] constructed an assembly map Ba A K(Z) > K(Zz), 
and this being an equivalence would say that all of the algebraic K-theory of Za 
comes in some sense from homology of z and K-theory of Z. This is known in some 
cases—for z free abelian, it follows from the “Fundamental Theorem of K-theory.” 
The assembly map being an equivalence in degrees < 1 for torsion-free groups 
xz and R = Z implies Conjecture 3.2. The analogue of Novikov’s Conjecture for 
K-theory is 


The group operation is the connected sum; inversion comes from reversing the orientation. 
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Conjecture 3.3 (Novikov Conjecture for K-Theory). Let R=Z,Q, 
R,or C and let a be a discrete group. Then the assembly map 
Brus A K(R) — K(Rx) induces an injection of rational homotopy 
groups. 


Conjecture 3.3 was proved (with R = Z, the most important case) for groups 
with finitely generated homology in [16]. It was also proved (without rationalizing) 
in [25], when z is a discrete, cocompact, torsion-free discrete subgroup of a 
connected Lie group. Subsequently, Carlsson and Pedersen [26] proved it (without 
rationalizing) for any group a for which there is a finite model for Bz, such 
that the universal cover Ex of Bx admits a contractible metrizable m-equivariant 
compactification X such that compact subsets of Ex become small near the 
“boundary” X ~ Ez. This was recently improved [92] to the case where there is a 
finite model for Bz and z has finite decomposition complexity, which is a tameness 
condition on z viewed as a metric space with the word length metric (for some finite 
generating set). 

As we have already mentioned, for groups with torsion, the assembly map 
A, of (3) is never an equivalence. For similar reasons, one also can’t expect the 
K-theory assembly map to be an equivalence for groups with torsion. The correct 
replacement seems to be the following.’ 


Conjecture 3.4 (Farrell-Jones Conjecture). Let 2 be a discrete group and 
let F be its family of virtually cyclic subgroups (subgroups that contain a 
cyclic subgroup of finite index). Such subgroups are either finite or else admit 
a surjection with finite kernel onto either Z or the infinite dihedral group 
(Z/2) * (Z/2). Let Ez (x) denote the universal m-space with isotropy in F . 
This is a contractible m-CW-complex X with all isotropy groups in ¥ (for 
the m-action) and with X" contractible for each H € F. It is known to be 
uniquely defined up to 1-homotopy equivalence. Then the assembly maps 


Hy (Eg(x);L(Z)) > L(Zx) and Hy (E(x); K(R)) > K(Rr) A) 


are isomorphisms for R = Z, Q, R, or C. 


7Just for the experts: one needs to use the —oo decoration on the L-spectra here. 
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When mz is torsion-free, (4) is just the assembly map (3) or its K-theory version, 
and the conjecture says that the assembly map is an equivalence. Conjecture 3.4 
implies Conjectures 3.1, 1.2, and 3.3, even for groups with torsion, as well as 
Conjecture 3.2. More details on Conjecture 3.4 may be found in [70], in [66, 
Chaps. 19-24], or in [71]. The K-theory version of the conjecture has been proven 
in [7] for fundamental groups of manifolds of negative curvature and in [6] for 
hyperbolic groups, and both the K-theory and L-theory versions have been proven 
for certain groups acting on trees in [5, 107] and for cocompact lattice subgroups of 
Lie groups in [4]. Split injectivity of (4) has been proved for groups with finite 
quotient finite decomposition complexity (a condition weaker than that of [92]) 
in [63]. Rational injectivity of (4) holds under much weaker conditions; see for 
example [30]. 

Another variation on the Novikov Conjecture is to consider the situation where a 
finite group G acts on a manifold, and one wants to study G-equivariant invariants 
of M. Under suitable circumstances, one finds that the fundamental group of M 
leads to a certain extra amount of equivariant topological rigidity. To formulate 
the analogue of Conjecture 1.2, one needs a substitute for the homology L-class 
-£(M) 2 [M]. The easiest way to formulate this is in K-homology, since Kasparov 
[59, 60], following ideas of Atiyah and Singer, showed that an elliptic differential 
operator D on M naturally leads to a K-homology class [D] € Ke(M) (see also [50] 
for an exposition), and when D is G-invariant, the class naturally lives in K(M). 
The image of [D] in K°(pt) = R(G) under the map induced by M — pt is the 
equivariant index indg D € R(G) in the sense of Atiyah and Singer. When D is 
the signature operator, “(M) M [M] is basically (except for some powers of 2, not 
important here) the Chern character of [D] € K.(M), and so iff: M — Bz, the 
higher signature of Definition 1.1, is basically the Chern character of f,([D]). That 
motivates the following. 


Conjecture 3.5 (Equivariant Novikov Conjecture [105]). Let M be a 
closed oriented manifold admitting an action of a finite group G, and suppose 
f: M = X is a G-equivariant smooth map to a finite G-CW complex which 
is G-equivariantly aspherical (i.e., X" is aspherical for all subgroups H of 
G). Let py: M’ — M be a G-equivariant map of closed G-manifolds which, 
non-equivariantly, is a homotopy equivalence. Then if [Dy and |Dy’| denote 
the equivariant K-homology classes of the signature operators on M and M’, 
respectively, 


fe(Dul) = (f° 9)*((Dw]) € KP). 
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Various generalizations and applications to rigidity theorems are possible (see for 
example [36, 104]), but we won’t go into details here. Conjecture 3.4 was proven in 
[105] for X a closed manifold of nonpositive curvature and in [43] for X a Euclidean 
building, in both cases with G acting by isometries. 


4 New Directions 


The conjectures we discussed in Sect.3 are fairly directly linked to the original 
Novikov Conjecture, and it is easy to see how they are connected with topological 
rigidity of highly connected manifolds. But in this section, we will discuss a number 
of other conjectures which grew out of work on Novikov’s Conjecture but which 
go somewhat further afield, to the point where the connection with the original 
conjecture may not be immediately obvious. However, we will try to explain the 
relationships as we go along. 

We have already mentioned the assembly map and the Farrell-Jones Conjecture 
(Conjecture 3.4), which gives a conjectural calculation of the L-groups L.(Zz) for 
a discrete group 2. However, work on Novikov’s Conjecture by analytic techniques 
(see Sect. 2) already required passing from the integral group ring to the complex 
group ring (this only affects 2-torsion in the L-groups) and then completing Cz toa 
C*-algebra. For C*-algebras, L-theory is basically the same as topological K-theory, 
and even for real C*-algebras, they agree after inverting 2 [99, Theorem 1.11]. 
So it’s natural to ask if assembly can be used to compute the topological K- 
theory of C*(z). For the full group C*-algebra this seems to be impossible, but 
for the reduced group C*-algebra C*(zr),® the completion of Cz for its action on 
L(x), there is a good guess for a purely topological calculation of K.(C*(z)). 
(Here K, denotes topological K-theory for Banach algebras, which satisfies Bott 
periodicity. This is much more closely related to L-theory, which is 4-periodic, 
than is algebraic K-theory in the sense of Quillen.) This guess is given by the 
Baum-Connes Conjecture, originally formulated in [9, 10] and further refined in 
[11] (see also [47] for a nice quick survey). The conjecture applies to far more 
than just discrete groups; it applies to locally compact groups, to such groups 
“with coefficients” (i.e., acting on a C*-algebra), and even to groupoids [113]. In 
its greatest generality the conjecture is known to be false [48], though a patch 
which might repair it has been proposed [13]. However, the original version of 
the conjecture is still open, though the literature on the conjecture has grown to 
more than 300 items. To avoid having to talk about Kasparov’s KK-theory, we will 
omit discussion of the conjecture with coefficients, and will just stick to the original 
conjecture for groups. 


8Tt is known that the natural map C* (zr) > C* (sr) is an isomorphism if and only if 7 is amenable. 
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Conjecture 4.1 (Baum-Connes Conjecture). Let G be a second countable 
locally compact group, and let C*(G) denote the completion of L(G) for its 
action by left convolution on L?(G). Then there is a natural assembly map 


wu: KE(@G) > K.(C(Q)), 


where &G is the universal proper G-space (a contractible space on which 
G acts properly), and this map is an isomorphism. If G has no nontrivial 
compact subgroups, then the assembly map simplifies to 


li) RBG) > Ks(CA(G)), 


Proposition 4.2. Conjecture 4.1 implies Conjecture 1.2. 


Proof. For this we take G = zr to be discrete and countable. For simplicity, we 
also work with the periodic L-theory spectra instead of the connective ones. (The 
difference only affects the bottom of the surgery sequence (1).) If 2 is torsion- 
free, the domain of yz is Ke(Bx) = H.(Bx;K?). But after inverting 2, K'°? is 
just a direct sum of two copies of L(Z), one of them shifted in degree by 2. So if 
Conjecture 4.1 holds for z and z is torsion-free, we have the commuting diagram 


Aq 


H.(Bx;L(Z)) @Q L.(Zz) ®Q 


H,(Bx;K"?) @Q —_> K.(Ci()) @Q. s 


Diagram (5) immediately implies that the rational L-theory assembly map A, [the 
same map as the map induced on rational homotopy groups by (3)] is injective. 

If z is not torsion-free, then & and Ex are not the same,’ but there is always a 
wt-equivariant map Ex — &z. Thus we need only replace (5) by the diagram 


°In the extreme case where z is a torsion group, & = pt, while if z is nontrivial, Ex is necessarily 
infinite dimensional. 
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Ar 


H, (Bx; L(Z)) ® Q —~ H? (Ex; L(Z)) ® Q ——> L. (Zz) @Q 


|- 


H,(Ba;K") @Q —* > H? (€n;K"?) @Q + > K.(Ci(z)) @Q. A 


Since points in yz have finite isotropy, and since the 7-map 2 —> z/o, 0 a finite 
subgroup of z, induces the map Z — R(c) on equivariant K-homology, a spectral 
sequence argument shows that the bottom left map a in (6) is injective, and so by a 
diagram chase, A, is injective. Oo 


Thus Conjecture 4.1 (for the case of discrete groups) implies Conjecture 1.2. 
However, Conjecture 4.1 for non-discrete groups is also quite interesting and 
important. There are two main reasons for this: 


1. There are “change of group methods” that enable one to pass from results for 
a group to results for a closed subgroup. Many of the significant early results 
on Novikov’s Conjecture were proved by considering discrete groups z that 
embed in a Lie group (or p-adic Lie group) and then using these change of group 
methods to pass from the Lie group to the discrete subgroup. 

2. The Baum-Connes Conjecture for connected Lie groups (also known as the 
Connes-Kasparov Conjecture) and the same conjecture for p-adic groups are both 
quite interesting in their own right, and say a lot about representation theory. For 
an introduction to this topic, see [11,47]. For some of the more significant results, 
see [14, 67, 110, 118]. For recent applications to harmonic analysis on reductive 
groups, see [3, 73, 90, 102]. 


Another direction arising out of both the controlled topology and the analytic 
approaches to Novikov’s Conjecture leads to the so-called coarse Baum-Connes 
Conjecture [49, 96, 120]. This conjecture deals with the large-scale geometry of 
metric spaces X of bounded geometry (think of complete Riemannian manifolds 
with curvature bounds, or of finitely generated groups with a word-length metric). 
Roughly speaking, the coarse Novikov Conjecture says that indices of generalized 
elliptic operators capture all of the coarse (i.e., “large-scale’’) rational homology of 
such a space X. 


Conjecture 4.3 (Coarse Baum-Connes and Novikov). Let X be a uni- 
formly contractible locally compact complete metric space of bounded 


(continued) 
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Conjecture 4.3 (continued) 

geometry, in which all metric balls are compact. Let KX.(X) be the coarse K- 
homology of X (the direct limit of the K-homologies of successively coarser 
Rips complexes) and let C*(X) be the C*-algebra of locally compact, finite 
propagation operators on X. Then Roe defined a natural assembly map 


[joo KO KC OO): (7) 


The coarse Baum-Connes Conjecture is that {4 is an isomorphism; the coarse 
Novikov Conjecture is that 1 is rationally injective. 


Positive results on Conjecture 4.3 may be found in [28, 29, 40, 42, 49, 96, 
114, 120]. 

However, it is known that the conjecture fails in various situations [32, 48, 121], 
especially if one drops the bounded geometry assumption. 

The coarse Baum-Connes conjecture implies the Novikov conjecture under mild 
conditions. To see this, suppose for example that there is a compact metrizable 
model Y for Bz, and let X = Em be its universal covering. Then there is a 
commutative diagram 


where a is usual Baum-Connes assembly, jz is as in Conjecture 4.3, ha denotes 
homotopy fixed points, and tr is a transfer map. Then jz being an isomorphism 
implies that jz!” is an isomorphism, and so we get a splitting for w. Refinements 
of this argument, as well as generalizations of the coarse Baum-Connes conjecture, 
may be found in [80]. 

Thinking of C*(z) as being (up to Morita equivalence) the same thing as the 
fixed points of 2 on C*(X) also gives rise to a nice way of relating the surgery 
exact sequence (2) to the Baum-Connes assembly map. This was accomplished in 
the series of papers [51—53, 89], which set up a natural transformation from the 
surgery sequence to a long exact sequence where the C*-algebraic assembly map 
corresponds to the L-theory assembly map in the original sequence. This gives an 
even more direct connection between coarse Baum-Connes and surgery theory. 

Other “new directions” from Novikov’s Conjecture arise from replacing the 
higher signature of Definition 1.1 with other sorts of “higher indices.” For example, 
an important case is obtained by replacing “(M) with A (M), the total A class. 
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This is again a certain polynomial in the rational Pontrjagin class, and has the 
property that when M is a spin manifold, A (M) lM [M] is the Chern character of 
the class [D] defined by the Dirac operator on M. (Here the reader doesn’t need 
to know much about the Dirac operator D except for the fact that it’s an elliptic 
first-order differential operator canonically defined on a Riemannian manifold with 
a spin structure.) It was pointed out by Lichnerowicz [68] that when M is closed and 
has positive scalar curvature, then the spectrum of D must be bounded away from 0, 
and thus ind(D) = (A (M), [M]) has to vanish. When M is not simply connected, a 
major strengthening of this is possible: 


Conjecture 4.4 (Gromov-Lawson Conjecture [46]). Let M be a connected 
closed spin Riemannian manifold of positive scalar curvature, let ma be a 
discrete group, and let f: M — Br be a continuous map (determined up 
to homotopy by a homomorphism m,\(M) — sz). Then the higher A-genus 
te (A (M)  [M]) € AH. (Bx, Q) vanishes. 


This conjecture is still open in general, but it is known to be closely related to 
Novikov’s Conjecture. For example, it was shown in [97] that Conjecture 4.4 is true 
whenever the K-theory assembly map K.(Bz) — K.(C*(z)) is rationally injective, 
and thus a fortiori whenever Conjecture 4.1 holds. It also can be deduced from 
certain cases of Conjecture 4.3, by a descent argument similar to the one above. 
The Lichnerowicz argument also applies to complete noncompact spin manifolds 
M of uniformly positive scalar curvature, and when Conjecture 4.3 holds, one gets 
obstructions to existence of such metrics living in K.(C*(X)) whenever there is a 
coarse map M —> X. 

Conjecture 4.4 can be refined to conjectures about necessary and sufficient 
conditions for positive scalar curvature. Here we just mention a few of several 
possible versions. For these it’s necessary to go beyond ordinary homology and 
to consider KO-homology, the homology theory dual to the (topological) K-theory 
of real vector bundles. This theory is 8-periodic and has coefficient groups KO; = Z 
when j is divisible by 4 (this part is detected by the Chern character to ordinary 
homology), Z/2 when j = 1,2 (mod 8), 0 otherwise. The class [D] of the Dirac 
operator on a spin manifold M lives in KO,(M), n = dim(M). While the actual 
operator D depends on a choice of a Riemannian metric, the class [D] € KO,(M) 
does not, so that the following conjecture makes sense. 
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Conjecture 4.5 (Gromov-Lawson-Rosenberg Conjecture). Let M be a 
connected closed spin manifold with fundamental group x and Dirac operator 
Dy, and let f: M — Bz be the classifying map for the universal cover. Let 
A: KO.(Bm) — KO.(C;(z)) be the assembly map in real K-theory. Then 
M admits a Riemannian metric of positive scalar curvature if and only if 
Ao fx([Du]) = 0 in KO, (C} (z)), n = dimM > 5. 


The restriction to n > 5 is needed only to use surgery methods to construct a 
metric of positive scalar curvature when the obstruction vanishes; it is not needed 
to show that there is a genuine obstruction to positive scalar curvature when 
Ao fx([Du]) 4 0, which was proven in [98]. For the next conjecture, we need to 
introduce a choice of Bott manifold, a geometric representative for Bott periodicity 
in KO-homology. This is a simply connected closed spin manifold Bt® of dimension 
8 with (f (Bt®), [Bt*]) = 1. It may be chosen to be Ricci flat. Since scalar curvature 
is additive on Riemannian products, Bt® being Ricci flat implies that taking a product 
with the Bott manifold does not change the scalar curvature. 


Conjecture 4.6 (Stable Gromov-Lawson-Rosenberg Conjecture). Let M 

be a connected closed spin manifold with fundamental group a and Dirac 

operator Dy, and let f: M — Bra be the classifying map for the universal 

cover. Let Bt® be a Bott manifold as above. Then M stably admits a Rieman- 
k 


——<—__ 7 
nian metric of positive scalar curvature, in the sense that M x Bt’ x --- x Bt 
admits such a metric for some k, if and only if A o fx([Dy]) = 0 in 
KO, (C} (zr)), n = dim M. 


There are simple implications 


Conj.4.5 = Conj.4.6, Conj.4.6 + injectivity of A > Conj. 4.4. 


The (very strong) Conjecture 4.5 is known to hold for especially nice groups, such as 
free abelian groups [98], hyperbolic groups of low dimension [55], and finite groups 
with periodic cohomology [18], but it fails in general [55, 108]. Conjecture 4.6 
is weaker, and holds for all the known counterexamples to Conjecture 4.5. It was 
formulated and proven for finite groups in [103]. Subsequently, Stolz [unpublished] 
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showed that it follows from the Baum-Connes Conjecture, Conjecture 4.1. For a 
survey on this entire field, see [100]. 

The last “new direction” we would like to discuss here comes from replacing the 
higher signature in Novikov’s Conjecture by the higher Todd genus or the higher 
elliptic genus. This seems to be quite relevant for understanding the interaction 
between topological invariants and algebraic geometry invariants for algebraic 
varieties defined over C. 

The Todd class .7(M) is still another polynomial in characteristic classes, this 
time the rational Chern classes of a complex (or almost complex) manifold. Suppose 
for simplicity that M is a smooth projective variety over C, viewed as a complex 
manifold via an embedding into some complex projective space. The Hirzebruch 
Riemann-Roch Theorem then says that 


(.7(M), [M]) = x(M, Ou) = >\(-1) dim H'(M, Oy), (8) 


j=0 


where @y is the structure sheaf of M, the sheaf of germs of holomorphic functions, 
and n is the complex dimension of M. The right-hand side of (8) is called the 
arithmetic genus. (The original definition of the latter by algebraic geometers like 
Severi turned out to be (—1)"(y(M, Gy) — 1), but the normalization here is a bit 
more convenient.) The left-hand side of (8) is called the Todd genus, and is known 
to be a birational invariant.'° Once again, if one has a map f: M — Bz, then we 
can define the associated higher Todd genus as fx(Y (M) MN [M]) € H.(Bx, Q). 


Conjecture 4.7 (Algebraic Geometry Novikov Conjecture [101]). Let M 
be a smooth complex projective variety, and let f: M — Bz be a continuous 


9 
map (for the topology of M as a complex manifold). Let M' «> M_ be 
a birational map. Then the corresponding higher Todd genera agree, i.e., 


fe(-F(M) 9 [M]) = (f 0 g)x(F(M) 9 [M’']) € He (Br, Q). 


Note the obvious similarity with Conjecture 1.2. However, unlike Novikov’s 
original conjecture, this statement is actually a theorem [15, 19]. That follows from 


‘ORecall that two varieties are said to be birationally equivalent if there are rational maps between 
them which are inverses of each. Since rational maps do not have to be everywhere defined (this 
is why we denote rational maps below by dotted lines), two varieties are birationally equivalent if 
and only if they have Zariski-open subsets which are isomorphic as varieties. 
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9 
the fact that if M’ => M_ is a birational map, then g.([Dw]) = [Dy] € 


Ko(M), where [Dy] denotes the K-homology class of the Dolbeault operator, 
whose Chern character is 7(M) M [M].'! The corresponding statement for the 
signature operator is not true; a homotopy equivalence does not have to preserve 
the class of the signature operator. (However, the mod 8 reduction of this class is 
preserved [106].) 

However, there is another similarity with Novikov’s Conjecture which is pointed 
out in [101]. By [112, Théoréme IV.17], £2., the graded ring of cobordism classes of 
oriented manifolds, is, after tensoring with Q, a polynomial ring in the classes of the 
complex projective spaces CP“, k € N. Then if /, is the ideal in 2, generated by all 
[M] — [M’] with M and M’ homotopy equivalent (in a way preserving orientation), 
Kahn [56] proved that 2.//. = Q, with the quotient map identified with the 
Hirzebruch signature. Similarly, QU, the graded ring of cobordism classes of almost 
complex manifolds, is, after tensoring with Q, a polynomial ring in the classes of 
all complex projective spaces, and the quotient of QU by the ideal generated by all 
[M|—[M’] with M and M’ birationally equivalent smooth projective varieties is again 
Q, this time with the quotient map identifiable with the Todd genus. 

These results effectively say that, up to multiples, the signature is the only 
homotopy-invariant genus on oriented manifolds, and the arithmetic genus is 
the only birationally invariant genus on smooth projective varieties. But if one 
considers manifolds with large fundamental group, the situation changes. By [101, 
Theorem 4.1], a linear functional on 2,(Bz) ® Q that is an oriented homotopy 
invariant must come from the higher signature, and by [101, Theorem 4.3], a linear 
functional on QY(Bz) @ Q that is a birational invariant must (under a certain 
technical condition satisfied in many cases) come from the higher Todd genus. 

Finally, the papers [17, 24, 44] consider still more analogues of higher genera 
with the Todd genus replaced by the elliptic genus. The result of [17] is particularly 
nice; it is the exact analogue of Conjecture 4.7, but with the Todd genus replaced 
by the elliptic genus and with birational equivalence replaced by K-equivalence 
(a birational equivalence preserving canonical bundles). 
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It takes a bit of work to make sense of gx here, since y may not be everywhere defined, but this 
can be done. The point is that by the factorization theorem for birational maps [1], we can factor @ 
into a sequence of blow-ups and blow-downs, and x is clearly defined for a blow-down (since it 
is a continuous map) and is an isomorphism in this case by the Baum-Fulton-MacPherson variant 
of Grothendieck-Riemann-Roch [12]. In the case of a blow-up, let gy» be given by the inverse of 
the map induced by the reverse blow-down. 


Novikov’s Conjecture 397 


References 


1, 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21, 


Abramovich, D., Karu, K., Matsuki, K., Witodarczyk, J.: Torification and factorization of 
birational maps. J. Amer. Math. Soc. 15(3), 531-572 (electronic) (2002). DOI 10.1090/ 
S$0894-0347-02-00396-X 


. Atiyah, M.F., Singer, I.M.: The index of elliptic operators. III. Ann. of Math. (2) 87, 546-604 
(1968) 
. Aubert, A.M., Baum, P., Plymen, R., Solleveld, M.: On the local Langlands correspondence 


for non-tempered representations. Miinster J. Math. 7, 27-50 (2014) 


. Bartels, A., Farrell, F.T., Liick, W.: The Farrell-Jones conjecture for cocompact lattices in 


virtually connected Lie groups. J. Amer. Math. Soc. 27(2), 339-388 (2014). DOI 10.1090/ 
S0894-0347-2014-00782-7 


. Bartels, A., Liick, W.: Isomorphism conjecture for homotopy K-theory and groups acting on 


trees. J. Pure Appl. Algebra 205(3), 660-696 (2006). DOI 10.1016/j.jpaa.2005.07.020 


. Bartels, A., Liick, W., Reich, H.: The K-theoretic Farrell-Jones conjecture for hyperbolic 


groups. Invent. Math. 172(1), 29-70 (2008). DOI 10.1007/s00222-007-0093-7 


. Bartels, A., Reich, H.: On the Farrell-Jones conjecture for higher algebraic K-theory. J. Amer. 


Math. Soc. 18(3), 501-545 (2005). DOI 10.1090/S0894-0347-05-00482-0 


. Bass, H., Murthy, M.P.: Grothendieck groups and Picard groups of abelian group rings. Ann. 


of Math. (2) 86, 16-73 (1967) 


. Baum, P., Connes, A.: Chern character for discrete groups. In: A féte of topology, 


pp. 163-232. Academic Press, Boston, MA (1988). DOI 10.1016/B978-0-12-480440- 1. 
50015-0 

Baum, P., Connes, A.: Geometric K-theory for Lie groups and foliations. Enseign. Math. (2) 
46(1-2), 3-42 (2000) 

Baum, P., Connes, A., Higson, N.: Classifying space for proper actions and K-theory of 
group C*-algebras. In: C*-algebras: 1943-1993 (San Antonio, TX, 1993), Contemp. Math., 
vol. 167, pp. 240-291. Amer. Math. Soc., Providence, RI (1994). DOI 10.1090/conm/167/ 
1292018 

Baum, P., Fulton, W., MacPherson, R.: Riemann-Roch and topological K theory for singular 
varieties. Acta Math. 143(3-4), 155-192 (1979). DOI 10.1007/BF02392091 

Baum, P., Guentner, E., Willett, R.: Expanders, exact crossed products, and the Baum-Connes 
conjecture. Ann. of K-Theory, 1(2), 155-208 (2016). URL http://arxiv.org/abs/1311.2343. 
Baum, P., Higson, N., Plymen, R.: A proof of the Baum-Connes conjecture for p-adic GL(n). 
C. R. Acad. Sci. Paris Sér. I Math. 325(2), 171-176 (1997). DOI 10.1016/S0764-4442(97) 
84594-6 

Block, J., Weinberger, S.: Higher Todd classes and holomorphic group actions. Pure Appl. 
Math. Q. 2(4, Special Issue: In honor of Robert D. MacPherson. Part 2), 1237-1253 (2006). 
DOI 10.4310/PAMQ.2006.v2.n4.a13 

Bokstedt, M., Hsiang, W.C., Madsen, I.: The cyclotomic trace and algebraic K-theory of 
spaces. Invent. Math. 111(3), 465-539 (1993). DOI 10.1007/BF01231296 

Borisov, L., Libgober, A.: Higher elliptic genera. Math. Res. Lett. 15(3), 511-520 (2008). 
DOI 10.4310/MRL.2008.v15.n3.a10 

Botvinnik, B., Gilkey, P., Stolz, S.: The Gromov-Lawson-Rosenberg conjecture for groups 
with periodic cohomology. J. Differential Geom. 46(3), 374-405 (1997). URL http:// 
projecteuclid.org/euclid.jdg/1214459973 

Brasselet, J.P., Schiirmann, J., Yokura, S.: Hirzebruch classes and motivic Chern classes for 
singular spaces. J. Topol. Anal. 2(1), 1-55 (2010). DOT 10.1142/S 17935253 10000239 
Browder, W.: Poincaré spaces, their normal fibrations and surgery. Invent. Math. 17, 191-202 
(1972) 

Browder, W.: Surgery on simply-connected manifolds. Springer-Verlag, New York- 
Heidelberg (1972). Ergebnisse der Mathematik und ihrer Grenzgebiete, Band 65 


398 


22 


23. 


24. 


2% 


26. 


27. 


28. 


29. 


30. 


31. 


32. 


33. 


34. 


35; 


36. 


37. 


38. 


39. 


40. 


41. 


42. 


J. Rosenberg 


. Browder, W.: Differential topology of higher-dimensional manifolds. In: Surveys on surgery 
theory, Vol. 1, Ann. of Math. Stud., vol. 145, pp. 41-71. Princeton Univ. Press, Princeton, NJ 
(2000) 

Cappell, S.E.: On homotopy invariance of higher signatures. Invent. Math. 33(2), 171-179 
(1976) 

Cappell, S.E., Libgober, A., Maxim, L.G., Shaneson, J.L.: Hodge genera of algebraic 
varieties. II. Math. Ann. 345(4), 925-972 (2009). DOI 10.1007/s00208-009-0389-6 
Carlsson, G.: Bounded K-theory and the assembly map in algebraic K-theory. In: Novikov 
conjectures, index theorems and rigidity, Vol. 2 (Oberwolfach, 1993), London Math. Soc. 
Lecture Note Ser., vol. 227, pp. 5-127. Cambridge Univ. Press, Cambridge (1995). DOI 
10.1017/CBO97805 1 1629365.004 

Carlsson, G., Pedersen, E.K.: Controlled algebra and the Novikov conjectures for K- and 
L-theory. Topology 34(3), 731-758 (1995). DOI 10.1016/0040-9383(94)00033-H 

Chang, S., Weinberger, S.: On invariants of Hirzebruch and Cheeger-Gromov. Geom. Topol. 
7, 311-319 (electronic) (2003). DOI 10.2140/gt.2003.7.311 

Chen, X., Wang, Q., Yu, G.: The maximal coarse Baum-Connes conjecture for spaces which 
admit a fibred coarse embedding into Hilbert space. Adv. Math. 249, 88-130 (2013). DOI 
10.1016/j.aim.2013.09.003 

Chen, X., Wang, Q., Yu, G.: The coarse Novikov conjecture and Banach spaces with Property 
(H). J. Funct. Anal. 268(9), 2754-2786 (2015). DOI 10.1016/j.jfa.2015.02.001 

Cortifias, G., Tartaglia, G.: Trace class operators, regulators, and assembly maps in K-theory. 
Doc. Math. 19, 439-456 (2014). URL https://www.math.uni-bielefeld.de/documenta/vol- 19/ 
14.pdf 

Davis, J.F.: Manifold aspects of the Novikov conjecture. In: Surveys on surgery theory, Vol. 1, 
Ann. of Math. Stud., vol. 145, pp. 195-224. Princeton Univ. Press, Princeton, NJ (2000) 
Dranishnikov, A.N., Ferry, S.C., Weinberger, S.: Large Riemannian manifolds which are 
flexible. Ann. of Math. (2) 157(3), 919-938 (2003). DOI 10.4007/annals.2003.157.919 
Farrell, F.T., Hsiang, W.C.: Manifolds with 2; = GX, T. Amer. J. Math. 95, 813-848 (1973) 
Farrell, F.T., Jones, L.E.: Negatively curved manifolds with exotic smooth structures. J. Amer. 
Math. Soc. 2(4), 899-908 (1989). DOI 10.2307/1990898 

Farrell, F.T., Jones, L.E.: Exotic smoothings of hyperbolic manifolds which do not support 
pinched negative curvature. Proc. Amer. Math. Soc. 121(2), 627-630 (1994). DOI 10.2307/ 
2160446 

Ferry, S., Rosenberg, J., Weinberger, S.: Phénoménes de rigidité topologique équivariante. 
C.R. Acad. Sci. Paris Sér. I Math. 306(19), 777-782 (1988) 

Ferry, S.C., Ranicki, A., Rosenberg, J.: A history and survey of the Novikov conjecture. In: 
Novikov conjectures, index theorems and rigidity, Vol. 1 (Oberwolfach, 1993), London Math. 
Soc. Lecture Note Ser., vol. 226, pp. 7-66. Cambridge Univ. Press, Cambridge (1995). DOI 
10.1017/CBO97805 1 1662676.003 

Ferry, S.C., Ranicki, A., Rosenberg, J. (eds.): Novikov conjectures, index theorems and 
rigidity. Vol. 1, London Mathematical Society Lecture Note Series, vol. 226. Cambridge 
University Press, Cambridge (1995). Including papers from the conference held at the 
Mathematisches Forschungsinstitut Oberwolfach, Oberwolfach, September 6-10, 1993 
Ferry, S.C., Ranicki, A., Rosenberg, J. (eds.): Novikov conjectures, index theorems and 
rigidity. Vol. 2, London Mathematical Society Lecture Note Series, vol. 227. Cambridge 
University Press, Cambridge (1995). Including papers from the conference held at the 
Mathematisches Forschungsinstitut Oberwolfach, Oberwolfach, September 6—10, 1993 
Finn-Sell, M.: Fibred coarse embeddings, a-T-menability and the coarse analogue of the 
Novikov conjecture. J. Funct. Anal. 267(10), 3758-3782 (2014). DOI 10.1016/j.jfa.2014. 
09.012 

Freedman, M.H.: The topology of four-dimensional manifolds. J. Differential Geom. 17(3), 
357-453 (1982). URL http://projecteuclid.org/euclid.jdg/1214437136 

Fukaya, T., Oguni, S.i.: Coronae of product spaces and the coarse Baum—Connes conjecture. 
Ady. Math. 279, 201-233 (2015). DOI 10.1016/j.aim.2015.01.022 


Novikov’s Conjecture 399 


43 


44. 


45. 


46. 


47. 


48. 


49. 


50. 


51. 


52. 


53. 


54. 


55: 


56. 


57. 


58. 


59. 


60. 


61. 


62. 


63. 


64. 


. Gong, D.: Equivariant Novikov conjecture for groups acting on Euclidean buildings. Trans. 
Amer. Math. Soc. 350(6), 2141-2183 (1998). DOI 10.1090/S0002-9947-98-01990-4 

Gong, D., Liu, K.: Rigidity of higher elliptic genera. Ann. Global Anal. Geom. 14(3), 
219-236 (1996). DOI 10.1007/BF00054471 

Gromoy, M.: Positive curvature, macroscopic dimension, spectral gaps and higher signatures. 
In: Functional analysis on the eve of the 21st century, Vol. If (New Brunswick, NJ, 1993), 
Progr. Math., vol. 132, pp. 1-213. Birkhauser Boston, Boston, MA (1996). DOI 10.1007/ 
s10107-010-0354-x 

Gromoy, M., Lawson Jr., H.B.: Positive scalar curvature and the Dirac operator on complete 
Riemannian manifolds. Inst. Hautes Etudes Sci. Publ. Math. 58, 83-196 (1984) (1983). URL 
http://www.numdam.org/item?id=PMIHES_1983__58__83_0 

Higson, N.: The Baum-Connes conjecture. In: Proceedings of the International Congress of 
Mathematicians, Vol. II (Berlin, 1998), Extra Vol. II, pp. 637-646 (electronic) (1998). URL 
https://www.math.uni-bielefeld.de/documenta/xvol-icm/08/08.html 

Higson, N., Lafforgue, V., Skandalis, G.: Counterexamples to the Baum-Connes conjecture. 
Geom. Funct. Anal. 12(2), 330-354 (2002). DOI 10.1007/s00039-002- 8249-5 

Higson, N., Roe, J.: On the coarse Baum-Connes conjecture. In: Novikov conjectures, 
index theorems and rigidity, Vol. 2 (Oberwolfach, 1993), London Math. Soc. Lecture Note 
Ser, vol. 227, pp. 227-254. Cambridge Univ. Press, Cambridge (1995). DOI 10.1017/ 
CBO9780511629365.008 

Higson, N., Roe, J.: Analytic K-homology. Oxford Mathematical Monographs. Oxford 
University Press, Oxford (2000). Oxford Science Publications 

Higson, N., Roe, J.: Mapping surgery to analysis. I. Analytic signatures. K-Theory 33(4), 
277-299 (2005). DOI 10.1007/s10977-005- 1561-8 

Higson, N., Roe, J.: Mapping surgery to analysis. II. Geometric signatures. K-Theory 33(4), 
301-324 (2005). DOI 10.1007/s10977-005- 1559-2 

Higson, N., Roe, J.: Mapping surgery to analysis. II]. Exact sequences. K-Theory 33(4), 
325-346 (2005). DOI 10.1007/s10977-005- 1554-7 

Hirzebruch, F.: The signature theorem: reminiscences and recreation. In: Prospects in 
mathematics (Proc. Sympos., Princeton Univ., Princeton, N.J., 1970), pp. 3-31. Ann. of Math. 
Studies, No. 70. Princeton Univ. Press, Princeton, N.J. (1971) 

Joachim, M., Schick, T.: Positive and negative results concerning the Gromov-Lawson- 
Rosenberg conjecture. In: Geometry and topology: Aarhus (1998), Contemp. Math., vol. 258, 
pp. 213-226. Amer. Math. Soc., Providence, RI (2000). DOI 10.1090/conm/258/04066 
Kahn, P.J.: Characteristic numbers and oriented homotopy type. Topology 3, 81-95 (1965) 
Kasparov, G.: Novikov’s conjecture on higher signatures: the operator K-theory approach. In: 
Representation theory of groups and algebras, Contemp. Math., vol. 145, pp. 79-99. Amer. 
Math. Soc., Providence, RI (1993). DOI 10.1090/conm/145/1216182 

Kasparov, G.G.: The homotopy invariance of rational Pontrjagin numbers. Dokl. Akad. Nauk 
SSSR 190, 1022-1025 (1970) 

Kasparov, G.G.: Topological invariants of elliptic operators. I. K-homology. Izv. Akad. Nauk 
SSSR Ser. Mat. 39(4), 796-838 (1975) 

Kasparov, G.G.: The operator K-functor and extensions of C*-algebras. Izv. Akad. Nauk 
SSSR Ser. Mat. 44(3), 571-636, 719 (1980) 

Kasparov, G.G.: Equivariant KK-theory and the Novikov conjecture. Invent. Math. 91(1), 
147-201 (1988). DOI 10.1007/BF01404917 

Kasparov, G.G.: K-theory, group C*-algebras, and higher signatures (conspectus). In: 
Novikov conjectures, index theorems and rigidity, Vol. 1 (Oberwolfach, 1993), London Math. 
Soc. Lecture Note Ser., vol. 226, pp. 101-146. Cambridge Univ. Press, Cambridge (1995). 
DOI 10.1017/CBO97805 1 1662676.007 

Kasprowski, D.: On the K-theory of groups with finite decomposition complexity. Proc. Lond. 
Math. Soc. (3) 110(3), 565-592 (2015). DOI 10.1112/plms/pdu062 

Kervaire, M.A.: Le théoréme de Barden-Mazur-Stallings. Comment. Math. Helv. 40, 31-42 
(1965) 


400 


65 


66. 


67. 


68. 
69. 


70. 


71. 


72. 


73. 


TA. 


73; 


76. 


77 


2. 


79. 


80. 


81. 


82. 


83. 


84. 


85. 


86. 


87. 


J. Rosenberg 


. Kervaire, M.A., Milnor, J.W.: Groups of homotopy spheres. I. Ann. of Math. (2) 77, 504-537 
(1963) 

Kreck, M., Liick, W.: The Novikov conjecture, Geometry and algebra, Oberwolfach Seminars, 
vol. 33. Birkhauser Verlag, Basel (2005). 

Lafforgue, V.: K-théorie bivariante pour les algébres de Banach et conjecture de Baum- 
Connes. Invent. Math. 149(1), 1-95 (2002). DOI 10.1007/s002220200213 

Lichnerowicz, A.: Spineurs harmoniques. C. R. Acad. Sci. Paris 257, 7-9 (1963) 

Loday, J.L.: K-théorie algébrique et représentations de groupes. Ann. Sci. Ecole Norm. Sup. 
(4) 9(3), 309-377 (1976) 

Liick, W.: K- and L-theory of group rings. In: Proceedings of the International Congress of 
Mathematicians. Volume II, pp. 1071-1098. Hindustan Book Agency, New Delhi (2010) 
Liick, W., Reich, H.: The Baum-Connes and the Farrell-Jones conjectures in K- and L-theory. 
In: Handbook of K-theory. Vol. 1, 2, pp. 703-842. Springer, Berlin (2005). DOI 10.1007/ 
978-3-540-27855-9_15. URL http://k-theory.org/handbook/2-0703-0842.pdf 

Lusztig, G.: Novikov’s higher signature and families of elliptic operators. J. Differential 
Geometry 7, 229-256 (1972) 

Mendes, S., Plymen, R.: Base change and K-theory for GL(n). J. Noncommut. Geom. 1(3), 
311-331 (2007). DOI 10.4171/JNCG/9 

Milnor, J.: On manifolds homeomorphic to the 7-sphere. Ann. of Math. (2) 64, 399-405 
(1956) 

Milnor, J.: A procedure for killing homotopy groups of differentiable manifolds. In: Proc. 
Sympos. Pure Math., Vol. III, pp. 39-55. American Mathematical Society, Providence, R.I 
(1961) 

Milnor, J.: Lectures on the h-cobordism theorem. Notes by L. Siebenmann and J. Sondow. 
Princeton University Press, Princeton, N.J. (1965) 

. Milnor, J.: Whitehead torsion. Bull. Amer. Math. Soc. 72, 358-426 (1966) 

Mishchenko, A.S.: Infinite-dimensional representations of discrete groups, and higher signa- 
tures. Izv. Akad. Nauk SSSR Ser. Mat. 38, 81-106 (1974) 

Mishchenko, A.S.: C*-algebras and K-theory. In: Algebraic topology, Aarhus 1978 (Proc. 
Sympos., Univ. Aarhus, Aarhus, 1978), Lecture Notes in Math., vol. 763, pp. 262-274. 
Springer, Berlin (1979) 

Mitchener, P.D.: The general notion of descent in coarse geometry. Algebr. Geom. Topol. 
10(4), 2419-2450 (2010). DOI 10.2140/agt.2010.10.2419. URL http://msp.org/agt/2010/10- 
4/p18.xhtml 

Novikov, S.P.: A diffeomorphism of simply connected manifolds. Dokl. Akad. Nauk SSSR 
143, 1046-1049 (1962) 

Novikov, S.P.: Homotopically equivalent smooth manifolds. I. Izv. Akad. Nauk SSSR Ser. 
Mat. 28, 365-474 (1964) 

Novikov, S.P.: Rational Pontrjagin classes. Homeomorphism and homotopy type of closed 
manifolds. I. Izv. Akad. Nauk SSSR Ser. Mat. 29, 1373-1388 (1965) 

Novikov, S.P.: Topological invariance of rational classes of Pontrjagin. Dokl. Akad. Nauk 
SSSR 163, 298-300 (1965) 

Novikov, S.P.: Pontrjagin classes, the fundamental group and some problems of stable algebra. 
In: Amer. Math. Soc. Translations Ser. 2, Vol. 70: 31 Invited Addresses (8 in Abstract) at the 
Internat. Congr. Math. (Moscow, 1966), pp. 172-179. Amer. Math. Soc., Providence, R.I. 
(1968) 

Novikov, S.P.: Algebraic construction and properties of Hermitian analogs of K-theory 
over rings with involution from the viewpoint of Hamiltonian formalism. Applications to 
differential topology and the theory of characteristic classes. I. I. Izv. Akad. Nauk SSSR Ser. 
Mat. 34, 253-288; ibid. 34 (1970), 475-500 (1970) 

Novikov, S.P.: Pontrjagin classes, the fundamental group and some problems of stable 
algebra. In: Essays on Topology and Related Topics (Mémoires dédiés 4 Georges de Rham), 
pp. 147-155. Springer, New York (1970) 


Novikov’s Conjecture 401 


88 


89. 


90. 


91. 


92. 


93. 


94. 


95. 


96. 


97. 


98. 


99. 


100. 


101. 


102. 


103. 


104. 


105. 


106. 


107. 


108. 


. Novikov, S.P.: Analogues hermitiens de la K-théorie. In: Actes du Congrés International des 
Mathématiciens (Nice, 1970), Tome 2, pp. 39-45. Gauthier-Villars, Paris (1971) 

Piazza, P., Schick, T.: The surgery exact sequence, K-theory and the signature operator. Ann. 
of K-Theory, 1(2), 109-154 (2016). URL http://arxiv.org/abs/1309.4370, DOI 10.2140/akt. 
2016.1.109. 

Plymen, R.J.: Reduced C*-algebra of the p-adic group GL(n). II. J. Funct. Anal. 196(1), 
119-134 (2002). DOI 10.1006/jfan.2002.3980 

Quinn, F.: A geometric formulation of surgery. In: Topology of Manifolds (Proc. Inst., Univ. 
of Georgia, Athens, Ga., 1969), pp. 500-511. Markham, Chicago, Ill. (1970) 

Ramras, D.A., Tessera, R., Yu, G.: Finite decomposition complexity and the integral Novikov 
conjecture for higher algebraic K-theory. J. Reine Angew. Math. 694, 129-178 (2014). DOI 
10.1515/crelle-2012-0112 

Ranicki, A.: The total surgery obstruction. In: Algebraic topology, Aarhus 1978 (Proc. 
Sympos., Univ. Aarhus, Aarhus, 1978), Lecture Notes in Math., vol. 763, pp. 275-316. 
Springer, Berlin (1979) 

Ranicki, A.: Algebraic and geometric surgery. Oxford Mathematical Monographs. The 
Clarendon Press, Oxford University Press, Oxford (2002). DOI 10.1093/acprof:oso/ 
9780198509240.001.0001. Oxford Science Publications 

Ranicki, A.A.: Algebraic L-theory and topological manifolds, Cambridge Tracts in Mathe- 
matics, vol. 102. Cambridge University Press, Cambridge (1992) 

Roe, J.: Coarse cohomology and index theory on complete Riemannian manifolds. Mem. 
Amer. Math. Soc. 104(497), x+90 (1993). DOI 10.1090/memo/0497 

Rosenberg, J.: C*-algebras, positive scalar curvature, and the Novikov conjecture. Inst. 
Hautes Etudes Sci. Publ. Math. 58, 197-212 (1984) (1983). URL http://www.numdam.org/ 
item?id=PMIHES_1983__58__197_0 

Rosenberg, J.: C*-algebras, positive scalar curvature, and the Novikov conjecture. III. 
Topology 25(3), 319-336 (1986). DOI 10.1016/0040-9383(86)90047-9 

Rosenberg, J.: Analytic Novikov for topologists. In: Novikov conjectures, index theorems 
and rigidity, Vol. 1 (Oberwolfach, 1993), London Math. Soc. Lecture Note Ser., vol. 226, 
pp. 338-372. Cambridge Univ. Press, Cambridge (1995). DOI 10.1017/CBO97805 11662676. 
013 

Rosenberg, J.: Manifolds of positive scalar curvature: a progress report. In: Surveys in 
differential geometry. Vol. XI, Surv. Differ. Geom., vol. 11, pp. 259-294. Int. Press, 
Somerville, MA (2007). DOI 10.4310/SDG.2006.v11.n1.a9 

Rosenberg, J.: An analogue of the Novikov conjecture in complex algebraic geometry. Trans. 
Amer. Math. Soc. 360(1), 383-394 (2008). DOI 10.1090/S0002-9947-07-04320-6 
Rosenberg, J.: Structure and applications of real C*-algebras. In: R.S. Doran, E. Park (eds.) 
Operator Algebras and Their Applications: A Tribute to Richard V. Kadison. Contemporary 
Math., Amer. Math. Soc., 2016. URL http://arxiv.org/abs/1505.04091 

Rosenberg, J., Stolz, S.: A “stable” version of the Gromov-Lawson conjecture. In: The Cech 
centennial (Boston, MA, 1993), Contemp. Math., vol. 181, pp. 405-418. Amer. Math. Soc., 
Providence, RI (1995). DOI 10.1090/conm/18 1/02046 

Rosenberg, J., Weinberger, S.: Higher G-indices and applications. Ann. Sci. Ecole Norm. 
Sup. (4) 21(4), 479-495 (1988). URL http://www.numdam.org/item?id=ASENS_1988_4_ 
21_4_479_0 

Rosenberg, J., Weinberger, S.: An equivariant Novikov conjecture. K-Theory 4(1), 29-53 
(1990). DOI 10.1007/BF00534192. With an appendix by J. P. May 

Rosenberg, J., Weinberger, S.: The signature operator at 2. Topology 45(1), 47-63 (2006). 
DOI 10.1016/j.top.2005.06.001 

Roushon, S.K.: The isomorphism conjecture in L-theory: graphs of groups. Homology 
Homotopy Appl. 14(1), 1-17 (2012). DOI 10.4310/HHA.2012.v14.n1.al 

Schick, T.: A counterexample to the (unstable) Gromov-Lawson-Rosenberg conjecture. 
Topology 37(6), 1165-1168 (1998). DOI 10.1016/S0040-9383(97)00082-7 


402 


109. 


110. 


111. 


112. 


113. 


114. 


115. 


116. 


117. 


118. 


119. 


120. 


121. 


J. Rosenberg 


Smale, S.: Generalized Poincaré’s conjecture in dimensions greater than four. Ann. of Math. 
(2) 74, 391-406 (1961) 

Solleveld, M.: Periodic cyclic homology of reductive p-adic groups. J. Noncommut. Geom. 
3(4), 501-558 (2009). DOI 10.4171/JNCG/45 

Sullivan, D.P.: Triangulating and smoothing homotopy equivalences and homeomorphisms. 
Geometric Topology Seminar Notes. In: The Hauptvermutung book, K-Monogr. Math., vol. 1, 
pp. 69-103. Kluwer Acad. Publ., Dordrecht (1996). DOI 10.1007/978-94-017-3343-4_3 
Thom, R.: Quelques propriétés globales des variétés différentiables. Comment. Math. Helv. 
28, 17-86 (1954). URL http://retro.seals.ch/digbib/view ?rid=comahe-002:1954:28::8 

Tu, J.L.: The Baum-Connes conjecture for groupoids. In: C*-algebras (Miinster, 1999), 
pp. 227-242. Springer, Berlin (2000) www.e-periodica.ch/digbib/view?pid=com-001:1954: 
28::8 

Tu, J.L.: The coarse Baum-Connes conjecture and groupoids. I. New York J. Math. 18, 1-27 
(2012). URL http://nyjm.albany.edu:8000/j/2012/18_1.html 

Wall, C.T.C.: Surgery of non-simply-connected manifolds. Ann. of Math. (2) 84, 217-276 
(1966) 

Wall, C.T.C.: Surgery on compact manifolds, Mathematical Surveys and Monographs, vol. 69, 
second edn. American Mathematical Society, Providence, RI (1999). DOI 10.1090/surv/069. 
Edited and with a foreword by A. A. Ranicki 

Wallace, A.H.: Modifications and cobounding manifolds. Canad. J. Math. 12, 503-528 (1960) 
Wassermann, A.: Une démonstration de la conjecture de Connes-Kasparov pour les groupes 
de Lie linéaires connexes réductifs. C. R. Acad. Sci. Paris Sér. I Math. 304(18), 559-562 
(1987) 

Weinberger, S.: The topological classification of stratified spaces. Chicago Lectures in 
Mathematics. University of Chicago Press, Chicago, IL (1994) 

Yu, G.: Coarse Baum-Connes conjecture. K-Theory 9(3), 199-221 (1995). DOI 10.1007/ 
BF00961664 

Yu, G.: The coarse Baum-Connes conjecture for spaces which admit a uniform embedding 
into Hilbert space. Invent. Math. 139(1), 201-240 (2000). DOI 10.1007/s002229900032 


The Discrete Logarithm Problem 


René Schoof 


Abstract For large prime numbers p, computing discrete logarithms of elements 
of the multiplicative group (Z/pZ)* is at present a very difficult problem. The 
security of certain cryptosystems is based on the difficulty of this computation. In 
this expository paper we discuss several generalizations of the discrete logarithm 
problem and we describe various algorithms to compute discrete logarithms. 


1 Introduction 


For a prime number p, the multiplicative group (Z/pZ)* is cyclic of order p — 1. 
Generators of (Z/pZ)* are called primitive roots mod p. Let p be a prime and let g 
denote a primitive root modulo p. Then for every x € (Z/pZ)* we have 


a 
x= 8, 


for some integer a. This integer is called the discrete logarithm of x and is denoted 
by logx. Since it depends on the primitive root g, one often writes log, x rather 
than log x. Since the discrete logarithm is only unique modulo p — 1, we view it as 
an element of the additive group Z/(p — 1)Z. Just as for the usual logarithm, we 
have for x, y € (Z/pZ)* that 


logxy = logx-+ logy. 


Here is an explicit example. Let p = 10000000259. Since (p — 1)/2 is prime, it is 
easy to see that g = 2 is a primitive root modulo p. We have 


log 3 = 9635867242, 
log5 = 227891530, 
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log 7 = 1803320787, 


illustrating the fact that there is no simple minded formula for log x in terms of x. 
Indeed, the discrete logarithms of the first few primes appear like random numbers 
in the interval from 0 to p — 1. 

Given a prime number p, a primitive root g € (Z/pZ)* and an exponent a 
in Z/(p—1)Z, the element x = g“ can be computed efficiently by repeated squarings 
and multiplications. On the other hand, given a large prime number p and a primitive 
root g € (Z/pZ)*, there are at present no good methods to compute the discrete 
logarithm of a given element x € (Z/pZ)*. In other words, computing the exponent 
a € Z/(p—1)Z for which x = g“, is a very difficult problem. In particular, there is no 
polynomial time algorithm known to perform this calculation. There do, however, 
exist subexponential algorithms. 

Designing good algorithms to compute discrete logarithms is a problem that is of 
interest in itself. It is also relevant for applications in cryptography. The security 
of the Diffie-Hellmann key exchange protocol [6] relies on the assumption that 
computing discrete logarithms is very hard. More precisely, the working hypothesis 
of this protocol is that given a prime number p, a primitive root g modulo p and 
elements x and y in (Z/pZ)*, but not their discrete logarithms a and J, it is very 
hard to compute the element g””. 

In this expository paper we describe some methods for computing discrete 
logarithms. We do not pretend to present the best known ones, but merely try to give 
the main ideas behind the most commonly used algorithms. In Sect. 2 we describe 
a natural generalization of the discrete logarithm problem and we discuss the baby- 
step-giant-step method due to Dan Shanks and a probabilistic method due to John 
Pollard. Section 3 is dedicated to the subexponential index calculus algorithm. 
In Sect.4 we discuss the discrete logarithm problem for multiplicative groups of 
finite fields of relatively small characteristic. In particular, we describe recent work 
of Antoine Joux and others. Finally in Sect.5, we discuss the discrete logarithm 
problem for groups of points of elliptic curves over finite fields. This is relevant for 
applications in cryptography. 

I thank Hendrik Lenstra for several useful comments on an earlier version of this 
paper. 


2 Exponential Algorithms 
Following ideas in [11, Sect. 7], we consider the following general problem. It is a 


natural extension of the discrete logarithm problem. 


Problem. Given a finite set S, a finite abelian group A and a group homomorphism 
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determine the kernel of f. 


In applications, the group A is the multiplicative group of a finite field or the 
multiplicative group (Z/nZ)* of the finite ring Z/nZ. It can also be an ideal class 
group of a number field, or the group of points of an elliptic curve or of an abelian 
variety over a finite field. In general, we assume that A is given to us in such a way 
that we can efficiently compose elements, calculate inverses and test for equality. 
Usually we even require that every element in A has a unique, easily computable 
“reduced” representative. But in general we do not suppose that we know the 
structure or even the cardinality of A. In most applications we know an upper bound 
for #A. Since Hom(Z5, A) is naturally isomorphic to A‘, the map f can be specified 
by giving #5 elements in A indexed by s € S. The kernel of f is a free group of the 
same rank as Z°. It can be described by giving #S generators. 

If S consists of one element, then f is determined by f(1) = x € A. Determining 
the kernel of f is the same problem as determining the order of the element x € A. 
An algorithm that solves this problem for the group A = (Z/nZ)* can be used to 
factor the integer n. See [13, Lemma 5]. If #5 = 2 and f is therefore given by two 
elements x,y € A, determining the kernel of f is the same as finding all relations 
between x and y that are of the form x*y” = 1 with (k,m) € Z?. In particular, if A 
is a cyclic group of order m generated by x, so that y = x“ for some a € Z, then the 
kernel of f is the subgroup generated by (m, 0) and (a, —1) € Z*. Determining kerf 
is therefore the same problem as computing the “discrete logarithm’ a of y. 

In general, the difficulty in computing the kernel of a group homomorphism f : 
ZS —+ A does not depend on the group structure of the finite abelian group A, but 
rather on the way it is presented. For instance, if A is the additive group Z/nZ of 
integers modulo n, presented in the usual way, the problem is very easy. It can be 
solved using at most O(#S) gcd computations in the ring Z, which can be efficiently 
calculated by means of the Euclidean algorithm. Indeed, when #5 = 1 the kernel 
of f : Z —> Z/nZ is generated by n/d, where d = gcd(f(1),n). For larger S one 
proceeds inductively, dealing in a similar way with one element of S at the time. On 
the other hand, if A is the cyclic multplicative group F; or the group E(F,) of an 
elliptic curve over F,, for some large p, there are no methods known to compute the 
kernel of a homomorphism f : Z‘ —> A that are of a comparable efficiency. 

Let f : ZS’ —> A be a homomorphism. Since both A and S are finite sets, the 
problem of determining the kernel of f is a finite issue. The straightforward naive 
algorithm to solve it, runs as follows. First assume that #5 = 1. In other words, 
we are given an element in x € A and we wish to compute its order. This can 
be done by computing the powers 1,x,x’,... of x until x’ is equal to the neutral 
element | € A. The exponent i is then the order of x and generates kerf. When S 
is larger and f is determined by elements x, y, z,... € A, we first list the elements 
of the subgroup H generated by x as above. Next we list all elements in the cosets 
y'H for i = 0,1,2,... The smallest exponent i for which y'H = H gives rise to 
a relation of the form y' = x/ and hence to an element in the kernel of f. Next 


406 R. Schoof 


one puts H = (x,y) and lists all elements of the cosets z'H ... etc. This method 
uses O(#A#S) operations in A. Since eventually all elements of A may be listed, the 
amount of memory required is O(#A). 

If we can compute a proper non-trivial subgroup B of A, then the problem of 
computing the kernel K of f : Z° —> A can be reduced to two similar problems 
involving the subgroup B and the quotient group B’ = A/B. More precisely, writing 
z for the canonical map A —> B’ and K’ for the kernel of the composite map 
a-f : ZS —> A — B, we have the following commutative diagram with exact 
rows and columns 


Oo 
Oo 


~ 


kerf —> K 


v v 


p=— eo 227g 
f f I 


Vv Vv 


6 2 = 4 — 


The group K’ is isomorphic to Z° for some finite set S’ having the same cardinality 
as S. The map f maps K’ to B. Since the kernel of the homomorphism f : K’ —> B, 
is isomorphic to K, we can compute K by first computing the kernel K’ of z -f : 
Z° —> B’ and then the kernel of f : K’ —> B. 

The groups B and B’ are smaller than A. Since B is contained in A, one 
can efficiently compose elements, calculate inverses and test for equality in B. 
Therefore, if one is also able to do this in the group B’ = A/B, then it is usually a 
good idea to make this reduction. This observation is due to Pohlig-Hellmann [16]. 
It applies for instance, if one knows a section j : B’ —> A of z so that A = B x B’. 
In this case equality tests in B’ can be performed in A. Another example is the case 
when A is cyclic and we know a proper divisor d of the order of n = #A. Then we can 
take B = dA and compute in B’ = A/dA exploiting the isomorphism B’ & (n/d)A 
given by multiplication by n/d. It follows that computing the kernel of f : Z’ —> A 
is relatively easy when all prime divisors of #A are small. Therefore, in this paper 
one should keep in mind groups A, for which #A is divisible by at least one large 
prime number. 

Next we describe a more efficient algorithm to compute the kernel of a 
homomorphism f : Z° —> A. It is the baby-step-giant-step algorithm due to Dan 
Shanks [20]. It is deterministic and uses O(#S/#A) operations and equality tests in 
the group A. It also requires the storage of O(/#A) elements of A. We explain the 
algorithm in the case #S = 1. For larger S, the idea remains the same, but, as in our 
description above of the naive algorithm, the details are more cumbersome to write 
down [3]. Any homomorphism f : Z —> A is determined by the element x = f(1) 
of A. Let a be the integer part of /#A + 1. We first make baby-steps. This means 
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that we make a list of the elements x! for 0 < i < a. If for some i in this range, x! 
is the neutral element of A, we are done: the smallest such 7 is the order of x and 
generates the kernel of f. If this is not the case, we make giant steps: we put y = x“ 
and compute y for 1 < j < a. Each time we check whether y’ is in the list that we 
made. In order to be able to do this efficiently, we assume that the elements in A 
are presented in some unique “reduced” way and that the list of the elements x’ for 
0 < i < ais sorted with respect to this presentation. Since the order of x is at most 
#A < a’, it is bound to happen that for some j, the element y’ is in the list. If it does, 


we have x1 = y' = x” for some i = 0,1,...,a. The first value of j for which this 
happens has the property that aj — i is the order of x and hence generates the kernel 
of f. 


There are also probabilistic algorithms to compute the kernel of f : Z° —> A. 
They have the same running time as the baby-step-giant-step algorithm. The 
advantage of the probabilistic algorithms is, that they do not require making lists 
of size /#A. We describe the so-called g-algorithm, due to John Pollard [17]. Once 
again we explain the algorithm only in the case #S = 1. In this case, we put x = f(1) 
and make a random, or rather pseudorandom, walk by evaluating elements of the 
form x” fori = 0,1,2,... with 1 = no < ny < nm.... This means that for 
each i, the next element x”'+! is computed in a pseudorandom fashion from the 
element x” € A. By the birthday paradox, one expects that x" = x” for two distinct 
values of i,j that are O(/#A). Moreover, this can be detected efficiently using the 
cycle detection algorithm, attributed to R.W. Floyd by Knuth [9, p. 7]. The order 
of x divides n; — nj. In practice the quotient is small, so that the order of x can be 
computed easily. 


3 Index Calculus 


Index calculus is a method to compute discrete logarithms and, more generally, 
to determine kernels of homomorphisms f : Z° —~ A, that applies when A is the 
multiplicative group of a finite field. In this section we assume that A = Fr for some 
large prime p. In the next section we consider the case where A is the multiplicative 
group of a finite field of small characteristic. 

Before considering the map f, we do a precomputation and use the index calculus 
algorithm to compute the kernel of 


~7T * 
Se Sal at 
where T is the set of the primes / < X for some bound X < p and h is the 


homomorphism that, for any prime / < X, maps the /th basis vector of Z’ to 
I (mod p). The kernel of h consists of the vectors (x;)ier € Z’ that satisfy 
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| [#=1inz 


leT 


and hence 


) > xjlog/ = 0 (mod p — 1), 


leT 


where log/ denotes the discrete logarithm of / with respect to any fixed primitive 
root in F;. The algorithm to determine ker / runs as follows. Pick random exponents 
e(J) > 0 with )>,<; e(/) bounded by some power of log p, that is sufficiently large 
in the sense that the products | [j<7 1 exceed p. Then check whether the remainder 
modulo p of []j¢7 /° is “X-smooth”. In other words, check whether its factorization 
in the ring Z is of the form [<7 4. If it is, we obtain the relation 


[ [°° = | [?®, in F*. 


leT leT 


It follows that the vector (e(/) — f(J))ies is in the kernel of h: 


Y\(e@) —fM) log =0, in Z/(p—1)Z. 


leT 


Repeating this procedure, we occasionally find that [[)<7 I? is X-smooth, hence 
obtain a non-trivial relation and thus a non-zero vector in the kernel of h. Once we 
have obtained a bit more than #T vectors, it is reasonable to expect that the vectors 
that we found, generate the kernel. 

It remains to choose the value of X. If X is very small with respect to p, there 
are very few X-smooth numbers in the set {1,2,...,p — 1}. Since the remainders 
of the products []j<7 I appear to be distributed randomly in this set, it is difficult 
to obtain relations and the algorithm may be time consuming. On the other hand, 
if X is very large, it is much easier to find X-smooth numbers and vectors in ker h. 
However, since we need more than #T relations, we need to find many more of them 
and the algorithm may also be time consuming. 

The optimal value of X is somewhere in the middle. It depends on the probability 
that a random natural number less than p is X-smooth. Writing X = p!/“ for some 
u > 1, this probability is roughly u~“. See [4]. A back of an envelope computation 
shows that the optimal value for u is approximately u = 2,/logp/ log log p. With 
this choice of u, computing the kernel of f involves 


exp(2/ log p log log p) 


elementary operations with numbers that have O(log p) digits. Therefore this is a 
subexponential algorithm. 
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With this choice of u, the set of primes / < X almost certainly generates F*, so 
that f is surjective. Therefore the induced map 


h: (Z/(@—1)Z)’ > F; 


is also surjective and hence split. This means that the kernel of h is the zero set of 
a single linear equation }°)<7 aX; = 0 (mod p — 1), with coefficients a; equal to 
log / with respect to the primitive root g that is given by g = [])<7/'. Here (ier 
is any vector for which one has 7 <7 a7) = 1 in Z/(p — 1)/Z. The equation can be 
computed efficiently using linear algebra over the ring Z/(p — 1)Z. This completes 
the description of the precomputation. 
In order to explain how to determine the kernel of the given homomorphism 
fi 
we consider first the case that #$ = 1. In this case f is determined by the element 
x = f(1) € F) and the kernel of f is generated by the order of x in the group F°. 
To compute the kernel, pick random products x | Tje7 I with T as above and check 
whether the factorization in Z of the remainder modulo p is of the form []j<7 4. If 
this happens, we obtain the relation 


x[ [re = [[”’. in F*. 


leT leT 


This implies that 


logx = (fe) logl, in Z/(p—1)Z. 


leT 


Since we already have computed log/ for every / € T, we can now evaluate log x. 
The order of x in the group F; is equal to the order of log x in the additive group 
Z/(p — 1)Z. It is therefore equal to p — 1 divided by gcd(p — 1, log x). 

The method for #S > 1 is based on this. One computes the discrete logarithm of 
f(s) for each element s € S. Composing f : Z5 —> FE with the discrete logarithm 
gives a homomorphism from Z) to the additive group Z/(p — 1)Z. As remarked 
above, determining the kernel of such a homomorphism is easy and can be done by 
means of linear algebra over Z. 


4 Finite Fields 


Recently there has been great progress in solving the discrete logarithm problem for 
finite fields of small characteristic. Indeed, in [1, 5, 7, 8] algorithms are described 
that almost run in polynomial time. As in the previous section, the algorithms 
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proceed by first computing the logarithms of a set of elements—the factor base— 
that are, in some sense, small. Next one uses this to solve the problem of computing 
the discrete logarithm of an individual element that is not in the factor base. Here 
we describe the first phase following Antoine Joux and his collaborators [1]. For the 
second phase we refer to the papers mentioned above for more details. 

As a typical example of the method, we discuss the case of a finite field of 
QO = q** elements, where g is a prime power and k is of the same order of magnitude 
as q. See [1] for a precise description of the range of finite fields for which the 
algorithm is effective. Let F,2 denote the subfield of q° elements of Fg. Note that if 
k and gq are approximately equal, q* is small with respect to Q = q?*. Therefore, 
making a list of all elements of F,2 can be done in time polynomial in log Q. 
As a consequence, computing discrete logarithms of elements in F*, can be done 
in time polynomial in log Q as well. Therefore, the Pohlig-Hellmann argument of 
Sect. 2 reduces the problem of computing discrete logarithms in the group F5 to the 
problem of computing discrete logarithms in the group A = FG/ F’,. 


We assume that the field with Q = q** elements is represented as F2[X]/(¢(X)), 
where $(X) is an irreducible degree k polynomial in F,2[X]. In order to have an 
efficient algorithm, the polynomial $(X) in Fg = F,2[X]/(¢(X)) is supposed to 
have a special shape: we want that 


X! = r(X) mod ¢(X), 


for some rational function r(X) € F,2(X) whose numerator and denominator have 
very small degrees. Examples are provided by the polynomials ¢(X) = X47! — g 
or XIt! — g, where g is a generator of the cyclic group F*,. In the first case we 
have r(X) = gX and in the second r(X) = g/X. See [23]. Numerical experiments 
suggest [1] that when k is close to q, one can find a rational function r(X) with 
denominator and numerator of degree at most 2, for which X47 — r(X) is divisible by 
an irreducible degree k polynomial ¢(X). Since an algorithm of Lenstra [10] allows 
one to compute an isomorphism between any presentation of the finite field Fg and 
F,2[X]/(¢(X)) in polynomial time, requiring $(X) to have this special shape, is not 
a serious restriction. 

In the first phase of the algorithm we compute the discrete logarithms of the 
elements in a factor base, which in this case consists of the images of all monic linear 
polynomials in F,2[X] in the group A = F2[X]/(¢(X))*/ F.- Putting T = F, this 
means that we compute the kernel of the homomorphism 


h:Z' A, 
that maps the basisvector e, corresponding to u € T to the image of X — wu in the 


group A = (F,2[X]/(¢(X)))"/F- 
The identity 
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x1-x = [](«-» 


ueF, 


implies that 


r(X)-X = [[(K-w, in Fe[X]/(6(X)). 


u€F, 


The denominator of the rational function r(X) —X is equal to the one of r(X). For the 
sake of exposition, we suppose that it factors into a product of linear polynomials in 
F2[X]. Its numerator has degree at most 3 and may or may not factor into a product 
of linear polynomials in F,2[X]. If it does, we obtain a multiplicative relation in the 
group A between the elements of our factor base. The relation gives then rise to an 
element in the kernel of the homomorphism h : Z7 —> A. 

In order to get more relations, we apply automorphisms of the fraction field 
F2(X) of F,2[X]. The group PGL2(F,2) acts on the right on F,2(X) as follows: 


aX +b 
cX +d 


ab 


f°(X) =f ) € PGLi(F,). 


), for any 0 = ( 


Cc 


Applying o € PGL(F,2) to the identity above, we obtain the equality 


(X°)? —X° = (XI-—X)’? = I] (X° —u), in F(X). 


u€F, 


The group PGL2(F,2) acts via linear fractional transformations on the left on the 
projective line P, and preserves the set of F,2-points P\(F,2). We view F,2 as a 
subset of P; (Fj). So we have P)(F,2) = F,2 U {oo}. For a function f € F2(X), a 
point u € P;(F,2) and an automorphism 0 € PGL2(F,2), we have f? (u) = f(o(u)). 


The above identity foro = é ’) € PGL2(F,2) then becomes 
c 
(cX + d)(aX +b) — (aX + b)(cX +d) = I] (X —u). 
u€o—!(P1(Fq))—{oo} 


It holds in the function field F,2(X) up to multiplication by some A € F*,. Note 
q 


that the set 0~'(P,(F,)) consists of g + 1 points and may or may not contain the 
point oo. Since X47 = r(X) modulo ¢(X), we have 


(aX +b)? =ar(X)+b and (cX +d)! =er(xX)+d 


in F2[X]/(¢(X)). Here we put ¢ = 14 fort € F,2. This leads to the following relation 
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(cX + d)(Gr(X) + b) — (aX + b)(Er(X) + d) = I] (X—u).  (*) 
uea—!(P}(Fq))—{oo} 

It holds in the ring F,2[X]/(¢(X)) up to multiplication by some A € F’,. The denom- 
inator of the left hand side of this equation is equal to the one of r(X). The numerator 
has degree at most 3 and it seems reasonable to expect that it varies randomly when 
we vary o. Numerical experiments have confirmed this [1]. Under this assumption 
a positive proportion factors into a product of linear polynomials in F,2[X]. Indeed, 
this proportion is approximately 1/6. For other choices of r(X), see [15]. A positive 
proportion of the relations (*) are therefore multiplicative relations between the 
elements X — u of the factor base in the group A = (F,2[X]/(¢(X)))*/ Fi. They 


give rise to elements in the kernel of the homomorphism h : Z’ —> A. 

The question is how many independent multiplicative relations between the 
elements X — u of the factor base we obtain in this way. The discrete logarithms 
of the elements X — u of the factor base are a solution of the system of linear 
equations that we obtain from the relations (+). There is at present no proof that 
we obtain sufficiently many relations for the linear system to have a unique solution 
over Z/MZ, where M = #A. However, there are some heuristic arguments in this 
direction that seem to be confirmed by experiments [1]. 

The subgroup of PGL2(F,2) that preserves the subset P,(F,) of P;(F,2), is equal 
to PGL2(F,). Therefore, the set ao! (P(F,)) and hence the right hand side of the 
relation («), depends only on the left coset o~'PGL2(F,) rather than on o7! itself. 
The number of cosets is equal to #PGL2(F,2)/#PGL2(F,) = q° + q. It follows that 
there are g* + q different subsets o~'(P)(F,)) and hence q? + q possibilities for 
the right hand sides of the relations (*). For a positive proportion the left hand sides 
of (*) factor into products of linear polynomials in F,2[X]. Therefore one expects 
many more than q’ relations between the elements X — u of the factor base, at least 
when q is not very small. As a consequence we obtain many more than q? elements 
in the kernel of the homomorphism h : Z7 —> A. 

The subsets o~'(P\(F,)) and therefore the right hand sides of the relations, 
are very different from one another. Indeed, it easy to see that any two distinct 
subsets o~'(P)(F,)) intersect in at most two points. Moreover, when o runs over 
representatives of the cosets of PGL2(F,) in PGL2(F,2) and P runs over the points 
of P\(F,2), the g? + q by q* + 1 matrix mg,p given by 


= 1, when P € 0 '(P\(F,)); 
7 0, otherwise. 

has maximal rank q? + 1. See [1]. In fact, it is not difficult to show that its rows 
span a subgroup of index g + 1 in Z? +1. The matrix of the homogeneous linear 
system we want to solve consists of the subset of rows of the matrix m,,p for which 
the left hand side of («) factors completely, somewhat perturbed by the few non- 
zero coefficients that come from the left hand side of (). Since the matrix mg, p has 
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maximal rank, it is perhaps not unreasonable to expect that our linear system has a 
unique solution over Z/MZ, where M = #A. 

It was pointed out by Wan et al. [5] that there is a problem with linear polynomials 
X —u that divide (X? — r(X))/@(X). Indeed, the relations that we find, not only hold 
modulo #(X), but also modulo X — u. Typically the multiplicity of X — u is the 
same on both sides of the relations (+). This cancellation implies that the logarithm 
of X — u does not appear in the linear system. Therefore it cannot be computed 
this way. For instance, in the case @(X) = X7~! — g, we have r(X) = gX and 
X4—gX = X¢(X). In this case, the logarithm of X may not at all appear in the linear 
system. In this special case however, computing the logarithm of X is easy since its 
order in the group A is g — 1, which is very small. See the original papers [1, 8] for 
ways to get around this problem in general. 


5 Elliptic Curves 


Elliptic curve cryptography is based on the difficulty of solving the discrete 
logarithm problem in the finite group E(F,) of points of an elliptic curve E over 
a finite field F,. More generally, determining the kernel of a homomorphism 


f: 25 — E@,), 


is a difficult problem. Apart from some exceptional situations that we describe 
below, the only methods that are available at present, are the baby-step-giant-step 
method and the Pollard 9-method that were discussed in Sect. 2. Since #E(F,) © q, 
both methods require O(,/q) operations in the group E(F,). This is much more than 
the number of operations required by the subexponential index calculus algorithm 
that was described in Sect. 3. Therefore, in cryptographical systems based on elliptic 
curves, key sizes can be made smaller, so that encryption and decryption algorithms 
are faster. 

Suppose that F is an elliptic curve over a finite field F, given by a Weierstrass 
equation 


Y* +a,X¥ +a3 = X? + aX? + a4X +. a6, 


with coefficients a; € F,. See [21] for the basic properties of elliptic curves. We let 
E (Fy) denote the group of points on EF with coordinates in an algebraic closure F, 
of F,. It is an infinite torsion group. The set E(F,) of points on £ with coordinates 
in F, is a finite subgroup. For every natural number n, we let E[n] denote the group 
of points on E that are annihilated by n. In other words, we have 


E|n] = {P € E(F,) : nP = 0}. 
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If n is not divisible by the characteristic p of F,, the group E[n] is isomorphic 
to Z/nZ x Z/nZ. The Weil pairing is a bilinear, antisymmetric and non-degenerate 
pairing 


én : En] x E[n] — pn. 


Here jz, denotes the subgroup of nth roots of unity of F. The pairing e, is Galois 
equivariant. 

Suppose now that the group E(F,) is cyclic of order n, coprime to p. Let O € E[n] 
be a point of order n with the property that the subgroup it generates has trivial 
intersection with E(F,). Then the map 


§: E(F,) — Ln 


given by g(P) = e,(P, Q) is an injective group homomorphism. It can be efficiently 
computed by means of an algorithm invented by Victor Miller [14]. In this way 
the kernel of f : Z5 —> E(F,) can be calculated by computing the kernel of the 
composite homomorphism 


ZS 1s E(B,) > pn @ Fy. 


Here d is the order of g modulo n. This approach is due to Menezes et al. [12]. It 
reduces the problem of computing the kernel of a homomorphism Z*° —> E(F,) to 
a similar problem involving the multiplicative group Fa rather than E(F,). 

Since the Weil pairing is Galois equivariant, the field of definition of the point Q 
contains Fj. Since d is typically very large, computing in the group E(F,) is very 
costly and this approach is usually not very successful. However, in certain special 
cases it can be very effective. An important example is provided by supersingular 
elliptic curves over prime fields F,. When p = 1 (mod 4), the group E(F,) of a 
supersingular curve E is cyclic of order p + 1. In this case j4»41 1s contained in an 
extension of F,, that has only degree d = 2. Therefore this method is very efficient. 
With small modifications it also works when p = 3 (mod 4) and more generally 
when the order of g modulo n = #E(F;) is small. In this situation, this use of the 
Weil pairing is an efficient way to compute discrete logarithms or, more generally, 
to compute the kernels of homomorphisms f : Z5 —> E(F,). 

An even faster algorithm was invented by Semaev [19] for elliptic curves E over 
prime fields F,,, for which the group E(F,) has order p. In this case an isomorphism 


g: Elp] — Z/pZ 
is constructed as follows. We fix a non-zero point Q in E(F,). For P € E(F,) we put 


- 


g(P) = fp 


(Q). 
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Here fp is a function on E whose divisor is equal to p(Q — co). We let fp denote 
the function for which we have the following equality of Kahler differentials: dfp = 
fpdX. The map g can be efficiently computed by means of Miller’s algorithm. In this 
way we can compute the kernel of f : Z’ —> E(F,) by computing the kernel of 


251, E(R,) "> Z/pZ. 


Similar related algorithms have been proposed by Satoh and Araki [18] and by 
Smart [22]. See Belding’s paper [2] for a relation between the algorithms described 
in this section. 
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Hadwiger’s Conjecture 


Paul Seymour 


Abstract This is a survey of Hadwiger’s conjecture from 1943, that for all t > 0, 
every graph either can be f-coloured, or has a subgraph that can be contracted to the 
complete graph on ¢t + 1 vertices. This is a tremendous strengthening of the four- 
colour theorem, and is probably the most famous open problem in graph theory. 


1 Introduction 


The four-colour conjecture (or theorem as it became in 1976), that every planar 
graph is 4-colourable, was the central open problem in graph theory for a hundred 
years; and its proof is still not satisfying, requiring as it does the extensive use 
of a computer. (Let us call it the 4CT.) We would very much like to know the 
“real” reason the 4CT is true; what exactly is it about planarity that implies that 
four colours suffice? Its statement is so simple and appealing that the massive case 
analysis of the computer proof surely cannot be the book proof. 

So there have been attempts to pare down its hypotheses to a minimum core, in 
the hope of hitting the essentials; to throw away planarity, and impose some weaker 
condition that still works, and perhaps works with greater transparency so we can 
comprehend it. This programme has not yet been successful, but it has given rise to 
some beautiful problems. 

Of these, the most far-reaching is Hadwiger’s conjecture. (One notable other 
attempt is Tutte’s 1966 conjecture [79] that every 2-edge-connected graph contain- 
ing no subdivision of the Petersen graph admits a “nowhere-zero 4-flow”, but that 
is beyond the scope of this survey.) Before we state it, we need a few definitions. 
All graphs in this paper have no loops or parallel edges, and are finite unless we say 
otherwise. If G is a graph, any graph that can be obtained by moving to a subgraph 
of G and then contracting edges is called a minor of G. The complete graph on t 
vertices is denoted by K;, and the complete bipartite graph with sides of cardinalities 
a, bis denoted by Kap. 
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By the Kuratowski-Wagner theorem [56, 83], planar graphs are precisely the 
graphs that do not contain Ks or K3,3 as a minor; so the 4CT says that every graph 
with no Ks or K3,3 minor is 4-colourable. If we are searching for the “real” reason 
for the four-colour theorem, then it is natural to exclude Ks here, because it is not 
four-colourable; but why are we excluding K3.3? What if we just exclude Ks, are all 
graphs with no Ks minor four-colourable? And does the analogous statement hold if 
we change Ks; to K;,, and four-colouring to f-colouring? That conjecture was posed 
by Hadwiger in 1943 [37] and is still open: 


Hadwiger’s Conjecture 1.1. For every integer t => 0, every graph with no K,+1 
minor is t-colourable. 


Let HC(f) denote the statement “every graph with no K;+; minor is f-colourable”’. 
Hadwiger proved HC(t) for t < 3 in 1943 when he introduced his conjecture. 
Wagner [83] had already shown that HC(4) is equivalent to the 4CT in 1937; and so 
HC(4) was finally proved when the 4CT was proved by Appel and Haken [4, 5] in 
1976. Then in 1993, Robertson, Thomas and I proved HC(5) [71]; one step further 
than the 4CT! And the proof did not use a computer (although it did assume the 4CT 
itself). HC(6) remains open. 

There have been numerous weakenings and variations proved, of various types, 
and strengthenings proposed, some of which still survive; and this is an attempt 
to survey them. Incidentally, there is an excellent 1996 survey on Hadwiger’s 
conjecture by Toft [78], which is particularly informative on the early history of 
the problem. 


2 The Proved Special Cases 


Let us first go through the results just mentioned more carefully. HC(O) and HC(1) 
are trivial. Graphs with no K3 minor are forests, which are 2-colourable, so HC(2) 
holds. The first case that is not quite obvious is HC(3). How do we show that graphs 
with no Ky, minor are 3-colourable? Hadwiger [37] showed that every non-null graph 
with no K4 minor has a vertex of degree at most two, which implies that all such 
graphs are 3-colourable; and there are later theorems of Dirac [22] and Duffin [24] 
on the same topic. This assembly of results can be expressed in several different 
ways, but here is one that is convenient for us. Take two graphs G;, G2, and for 
i = 1,2 let C; be a clique (that is, a subset of vertices, all pairwise adjacent) of Gj, 
where |C;| = |C2|. Choose some bijection between the cliques, and identify each 
vertex of C; with the corresponding vertex of C7. We obtain a graph H say, with two 
subgraphs isomorphic to G;, G2 respectively, overlapping on a clique. Now let G be 
obtained from H by deleting some edges (or none) of the clique; we say that G is a 
clique-sum of G;, G2, and if the clique has size k, we also call it a k-sum. 

It is easy to see that if G is a clique-sum of G;, G2, and both G;, G2 are 
t-colourable, then so is G. So if G can be built by repeated clique-sums starting 
from some basic class of graphs that are all t-colourable, then so is G. This gives us 
a slick proof of HC(t) for t < 3, because of the following: 
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Theorem 2.1. For 0 < t < 3, the graphs with no K,4, minor are precisely the 
graphs that can be built by repeated clique-sums, starting from graphs with at most 
t vertices. 


HC(4) implies the 4CT, so we should not expect Theorem 2.1 to extend to t = 4. 
And it doesn’t; large grids have no Ks; minor and yet cannot be built from 4-vertex 
graphs by clique-sums. (Indeed, let us say G has tree-width k if k is minimum 
such that G can be built by clique-sums from pieces with at most k + 1 vertices; 
then the n x n grid has tree-width n.) Nevertheless, we can describe all the graphs 
with no Ks minor in this language. Let Vg be the graph obtained from a cycle of 
length 8 by adding four edges joining the four opposite pairs of vertices of the cycle. 
Wagner [83] essentially proved the following in 1937. 


Theorem 2.2. The graphs with no Ks minor are precisely the graphs that can 
be built by repeated O0-, 1-, 2-, and 3-sums, starting from planar graphs and 
copies of V3. 


Consequently the 4CT implies HC(4), as Wagner points out in his 1937 
paper [83]. (Of course, this does not yet provide the profound insight into the four- 
colour theorem we hope for, because not only does the proof of HC(4) use the 4CT, 
but the graphs it concerns are themselves basically planar.) 

What about HC(5)? One might imagine that since the curve of difficulty versus ¢ 
has recently had such a steep slope, HC(5) would be impossible (or false); but that 
is not so. Suppose it is false, and let G be a smallest counterexample. Robertson 
et al. [71] showed, without using a computer and without assuming the four- 
colour theorem, that G must be an apex graph, that is, there is a vertex whose 
deletion makes it planar. If so, then since the 4CT implies that the planar part of 
G is 4-colourable, we still have a colour left for the vertex we deleted, so G is 
5-colourable after all. 

The proof that G is apex is (very roughly) as follows. One can show that G is 
6-connected, and in particular all vertices have degree at least six; and vertices of 
degree six belong to K4 subgraphs, and it follows that there are not many of them 
(in fact at most two), or else we could piece together all these K4’s to make a Ke 
minor. On the other hand, a theorem of Mader says that the average degree of G is 
less than eight, and we cannot make the average degree bigger then eight even if we 
cleverly contract edges. That implies that there are edges that are in several triangles 
or squares. If, say, there is an edge uv in four triangles, then there is no K4 minor of 
G \ {u, v} on the four surviving vertices of the triangles (since G has no K¢ minor), 
and graphs with this property are well-understood; basically they have to be planar 
with the four special vertices on the infinite region. So G \ {u, v} is planar, and now 
a little more thought shows that one of G \ u, G \ v is planar, and hence G is apex. 

Proving that graphs with no K7-minor are 6-colourable is thus the first case of 
Hadwiger’s conjecture that is still open. Albar and Gongalves[2] proved: 


Theorem 2.3. Every graph with no K7 minor is 8-colourable, and every graph with 
no Kg minor is 10-colourable. 
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Since we are stuck trying to prove Hadwiger’s conjecture itself, let us see what 
we can show about the chromatic number of graphs with no K,+4; minor. As 
Wagner [82] proved in 1964, all graphs with no K,; minor are 2'-colourable. The 
proof is as follows: we may assume G is connected; fix some vertex z, and for each 
i let L; be the set of vertices at distance i from z; since G has no K,+; minor, the 
subgraph induced on L; has no K; minor (because the union of all the earlier levels 
would provide one more vertex in the minor); inductively each level L; induces a 
subgraph that is 2'~'-colourable; and now alternate colours in even and odd levels 
to get a 2’-colouring of G. 

Wagner’s result has been considerably improved, but most of these improvements 
depend on “degeneracy”, so let us first discuss that. We say G is k-degenerate if 
every non-null subgraph has a vertex of degree at most k. For instance, forests are 
1-degenerate, series-parallel graphs (the graphs with no K4 minor) are 2-degenerate, 
and planar graphs are 5-degenerate. By deleting a vertex of degree at most k and 
applying an inductive hypothesis, we have: 


Theorem 3.1. /f G is k-degenerate then its chromatic number is at most k + 1. 


So, if we can bound the degeneracy of the graphs with no K;,; minor, we also 
bound their chromatic number. (This gives us another proof of HC(f) for t < 3, 
because for t < 3 every graph with no K;41 minor is (t — 1)-degenerate.) 

The simplest way to bound the degeneracy is to bound the average degree. How 
many edges an v-vertex graph with no K; minor can have is a much-studied question. 
Mader [59, 60] showed in 1967 that: 


Theorem 3.2. For every graph H there exists c such that |E(G)| < c|V(G)| for 
every graph G with no H minor. 


But when H = K, for small values of t, we know the answer exactly: 


¢ forn > 1, n-vertex graphs with no K3 minor (forests) have at most n — 1 edges; 
¢ forn > 2, graphs with no Ky, minor have at most 2n — 3 edges; 
¢ forn > 3, graphs with no Ks; minor have at most 3n — 6 edges. 


Here is an example: for n > t — 2, take the complete bipartite graph K)~2 n-;+42, 
and add edges joining all pairs of vertices on the side of cardinality t — 2. This has 
no K; minor, and has n vertices and (ft —2)n— (t— 1)(t—2)/2 edges. Thus for t < 5, 
this graph has the maximum number of edges possible, and if this were so for all f, 
it would prove Hadwiger’s conjecture within a factor of 2. Mader [60] showed that 
the same holds for t = 6, 7: 


Theorem 3.3. Fort < 7 andalln > t—2, every n-vertex graph G with no K; minor 
Satisfies 


|E(G)| < (¢—2)n— (¢- 1)¢—2)/2. 
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But for tf > 8 the pattern fails. If n),...,n; > 0, we denote by K;y,_..n, the 
complete f-partite graph with parts of cardinality n;,...,m,. Mader pointed out that 
K22.2.2.2 has no Kg minor, and does not satisfy the formula of Theorem 3.3. 

On the other hand, for tf = 8 we understand all counterexamples to the 
formula. In the definition of a k-sum we are permitted to delete edges from the 
clique involved; if we do not delete any such edges let us call it a pure k-sum. 
Jgrgensen [39] proved: 


Theorem 3.4. Let G be an n-vertex graph with no Kg minor, with n > 6 and 
|E(G)| > 6n — 21; then |E(G)| = 6n — 20, and G can be built by pure 5-sums 
from copies of K.2.2.2.2. 


The same holds for Ko; Song and Thomas [72] proved: 


Theorem 3.5. Let G be an n-vertex graph with no Ko minor, with n > 7 and 
|E(G)| > 7n — 28; then |E(G)| = 7n — 27, and either G = Kz 2233, or G can 
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But as ¢ grows, the formula of Theorem 3.3 becomes completely wrong. 
For a graph H, let #(H) be the infimum of all d such that every graph G 
with no H minor has average degree at most d, that is, satisfies |E(G)| < 
d|V(G)|/2. (We are particularly concerned here with the case when H is a 
complete graph K;, but ¢(#) is of interest for non-complete graphs too.) Kostochka 
[49, 51] and Fernandez de la Vega [28] proved that @(K;) is at least of order 
t(log t)'/ 2, and Kostochka [49, 51] and Thomason [73] proved the same was an 
upper bound; and in particular Kostochka [51] showed (logarithms are to base e): 


Theorem 3.6. For every integer t > 4, 6 = $(K;) satisfies: 


0.0326 
(log(p/2))'/? ~~ 


Later Thomason [74] found the limit exactly: he proved (again with logarithms 
to base e): 


Theorem 3.7. Let A < 1 be the solution of the equation 1 — 4 + 2AlogA = 0 
and let 


a = (1—A)log(1/A)~!/? = 0.63817. 


Then as t > 00, (K;) = (a + o(1))t(log t)!/?. 


This was extended to non-complete graphs by Myers and Thomason [63], who 
proved the following (IR* denotes the set of nonnegative real numbers, and @ is as 
before): 
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Theorem 3.8. Let H be a graph with t vertices, and let y(H) be the minimum of 
ue) w(u) over all functions w : V(H) — R* such that 


Ye perv) <t. 


uv€E(A) 


1 
ri 


Then as t > 00, 6(H) = (ay(A) + o(1))t(log t)!/?. 


For classes of graphs H with y(H) bounded away from zero (such as regular 
graphs with degree ct* where c,€ > 0), this determines ¢(H) asymptotically; but 
for some classes of graphs (such as those with a linear number of edges) it does not. 
This gap is addressed by two theorems of Reed and Wood [67]: 


Theorem 3.9. There is a constant dy such that ¢(H) < 3.895(log d)'/*t for every 
graph H with t vertices and average degree d = do. 


Theorem 3.10. For every graph H, $(H) < |V(A)| + 6.291|E(A)|. 


The Myers-Thomason theorem implies that ¢(H) is not linear in ¢ for graphs 
with f vertices and with a quadratic number of edges; but the second Reed-Wood 
theorem implies that if |E(H)| is linear in ¢ then so is f(A). 

For some graphs H we can determine exactly the maximum number of edges in 
graphs with no H minor, but those theorems are thinner on the ground. We already 
mentioned the cases when H = K;,; and the same can be done for many graphs 
H with at most six vertices, such as K33; and there are two theorems doing it for 
larger graphs H. Chudnovsky et al. [14] answered it for Ky, (extending a result of 
Myers [62]), and Kostochka and Prince [52] did it for K3, (and the K,, result is 
obvious): 


Theorem 3.11. Let G be an n-vertex graph with no H minor. 


° IfH = K,, then |E(G)| < $(t— 1); 
* ifH = Kp, then |E(G)| < 3(t+ 1)(n— 1); and 
° ifH = K3, and t > 6300 andn > t+ 3 then |E(G)| < (t+ 3)(n—2) + 1. 


All three results are exact for infinitely many values of n. (By the way, when 
H = Kj,,, if we restrict to connected graphs G then the answer is quite different, 
namely |E(G)| <n+ (t+ 1)(t—2)/2 ifn = t + 2; see [20].) 

What about H = K,, in general, if t > s? For fixed s and large ¢, the value of 
(Ks) is not determined by Theorem 3.8, so this is an interesting case. It turns out 
to be more natural to exclude K;, instead; this is the graph obtained from K,, by 
adding edges joining all pairs of vertices in the side of cardinality s. Extrapolating 
from Theorem 3.11, one might hope that if an n-vertex graph has no K,, minor then 


IK(G)| < 5Qs+1-3)n—F(s— str), 
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because again this can be attained with equality for infinitely many n (take many 
disjoint copies of K, and add s — 1 extra vertices adjacent to everything). But this 
is not true, at least for s > 18. Kostochka and Prince [53, 54] proved (and see 
also [55] for a related result) that with the function ¢(H) as before (here logarithms 
are binary): 


Theorem 3.12. Let s, t be positive integers with t > (180s log s)'!*% £5, Then 
3s — 5s? +t < b(Ks1) < O(K*,) < 35 +1. 


All these results tell us that the graphs with a certain minor H excluded have 
average degree at most some constant, and therefore have minimum degree at most 
the same constant; and that gives us a bound on their degeneracy. In particular, from 
Theorem 3.6, every graph with no K;, minor has degeneracy at most O(t(log t)!/), 
and therefore chromatic number at most the same. For large f, this is the best bound 
known on the chromatic number of graphs excluding K;. 

Incidentally, bounding minimum degree by average degree is natural, but it might 
not give the right answer. For instance, graphs with no K4 minor can have average 
degree > 3; and yet they always have minimum degree at most 2. When we exclude 
Ks, average degree gives the true bound for minimum degree; but what happens with 
Ke? Graphs with no Kg minor can have average degree more than 7, but must they 
have minimum degree at most 6? I think this is open. 


4 Stability Number 


One possible cause of the intractability of Hadwiger’s conjecture is that we need to 
use the fact that the chromatic number is large, and graphs can have large chromatic 
number for obscure reasons. What if we make our lives easier, and look at graphs 
that have large chromatic number for obvious reasons? The stability number a(G) 
of a graph G is the size of the largest stable set (a set of vertices is stable if no two of 
its members are adjacent). (This is different from Thomason’s a, which we do not 
need any more.) Every n-vertex graph G has chromatic number at least [n/a(G)], 
and should contain a clique minor of this size if Hadwiger’s conjecture is true. Can 
we prove this at least? 

The signs are not good; the only known proof that every n-vertex planar graph 
has stability number at least n/4 is via the 4CT. Nevertheless, there are some results. 
There is an elegant argument by Duchet and Meyniel [23] proving: 


Theorem 4.1. Every n-vertex graph G has a K; minor where t > n/(20(G) — 1). 
Their argument can also be used to show a result that seems to have been overlooked: 


Theorem 4.2. For every graph G with no K,4, minor, there exists a t-colourable 
induced subgraph containing at least half the vertices of G. 
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Theorem 4.1 is within a factor of 2 of what should be true, and there have been 
subsequent improvements, notably by Fox [29] (who proved a factor slightly less 
than 2) and then Balogh and Kostochka [6], who reduced Fox’s factor a little further 
and currently have the record. They showed the following: 


Theorem 4.3. Every n-vertex graph G has a K, minor where t > 0.51338n/a(G). 


A different strengthening, better than Theorem 4.3 when @ is small, was proved 
by Kawarabayashi and Song [47]: 


Theorem 4.4. Every n-vertex graph G with a(G) => 3 has a K; minor where t = 
n/(2a(G) — 2). 


Returning to Theorem 4.1: it implies that if G has no K;4, minor then some stable 
set has cardinality at least n/(2t). Suppose we give each vertex of G a nonnegative 
real weight. Hadwiger’s conjecture would imply that there is a stable set such that 
the total weight of its members is at least 1/t times the sum of all weights. One 
might hope to prove a weighted version of Theorem 4.1 (without the —1 in the 
denominator) and this turns out to be true, though more difficult to prove. Say the 
fractional chromatic number of a graph G is the minimum real number k such that 
for some integer s > 0, there is a list of ks stable sets of G such that every vertex is in 
s of them. Via linear programming duality, the weighted Duchet-Meyniel statement 
is equivalent to the following, proved by Reed and myself [65]: 


Theorem 4.5. Every graph with no K,4, minor has fractional chromatic number at 
most 2t. 


The proof also gives a corresponding extension of Theorem 4.2: 


Theorem 4.6. Jn every graph G with no K,4, minor, there is a non-null list of t- 
colourable subsets of V(G), such that every vertex is in exactly half of the sets in the 
list. 


Graphs G with w(G) = 2 are particularly interesting, because these graphs are 
more tractable for colouring; for instance, there is a polynomial-time algorithm to 
find the chromatic number of such a graph (just find the largest matching in the 
complement graph). Here is another nice feature of them: say a seagull in a graph 
G is an induced 3-vertex path. If #(G) = 2 and S is a seagull in G then every other 
vertex of G has a neighbour in S, and so finding many disjoint seagulls is a way 
to find a large clique minor. In [15], Chudnovsky and I proved there is a min-max 
formula for the maximum number of disjoint seagulls in a graph G with a(G) = 2. 

For an n-vertex graph G with a(G) = 2, the Duchet-Meyniel theorem implies 
that there is a K, minor with t > n/3. This was strengthened by Béhme et al. [9], 
who proved (for graphs with arbitrary stability number): 


Theorem 4.7. Every n-vertex graph with chromatic number k has a K, minor where 
t > (4k —n)/3. 


But Hadwiger’s conjecture implies that if w@(G) = 2 then there should be a 
K, minor with t > n/2. This seems to me to be an excellent place to look for a 
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counterexample. My own belief is, if it is true for graphs with stability number two 
then it is probably true in general, so it would be very nice to decide this case. 
Despite some intensive effort the following remains open: 


Open Question 4.8. Does there exists c > : such that every graph G with a(G) = 
2 has a K, minor where t > c|\V(G)|? 


A graph is claw-free if no vertex has three pairwise nonadjacent neighbours. Thus 
graphs with stability number two are claw-free. Fradkin [31] proved: 


Theorem 4.9. Every n-vertex connected claw-free graph G with a(G) > 3 has a 
K, minor where t > n/a(G). 


Chudnovsky and Fradkin [13] proved: 
Theorem 4.10. Every claw-free graph G with no K,4, minor is |3t/2|-colourable. 


Line graphs are claw-free, so these last two results are related to a theorem of 
Reed and myself; we proved [66] that Hadwiger’s conjecture is true for line graphs 
(of multigraphs). 


5 Weakenings 


The statement of Hadwiger’s conjecture is: 

For all t = 0 and every graph G, either G has a K;4, minor or V(G) can be 
partitioned into t stable sets. 

How can we weaken this and still have something non-trivial? Section 2 covered 
changing “For all tf > 0” to “For a few t > 0”; Sect.3 did changing “t stable 
sets” to “f(t) stable sets”; and Sect. 4 covered changing “partitioned into stable sets” 
to “fractional chromatic number”; but there are several other ways to weaken the 
statement. Here are some. 


Change “every graph G”’ to “almost every graph G” (The meaning of “almost 
every” here is that the proportion of n-vertex graphs that satisfy the statement tends 
to 1 as n — oo.) This weakening is true. It follows from a combination of a theorem 
of Bollobas et al. [10] and a theorem of Grimmett and McDiarmid [35]: 


Theorem 5.1. For all d > 2, almost every n-vertex graph has a K, minor where 
t > n/((logn)'/? + 4), and has chromatic number at most 2n/ log n. 


Change “K;,41’ to something else If we hope to prove that every graph with no H 
minor has chromatic number at most t, then H had better have at most t+ 1 vertices, 
or else taking G = K,+, is acounterexample. So, which subgraphs H of K,4, work? 
Kostochka [48, 50] proved the following. 
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Theorem 5.2. For all s there exists to such that for all t > to, every graph with no 
K¥, minor is (s + t — 1)-colourable. 


Change “stable sets” to something else Here is a recent theorem of Edwards 
et al. [26]: 


Theorem 5.3. For all t there exists k such that if G has no K,4, minor then V(G) 
can be partitioned into t sets X,,...,X,, such that for 1 <i < t, G[X;] has maximum 
degree at most k. 


(For X € V(G), G[X] denotes the subgraph induced on X.) This result is quite 
easy, but it has two attractive features; first, it is best possible in that if we ask for a 
partition into f— 1 sets there is no such k; and second, it and Theorem 6.8 below are 
the only results known that derive a partition into ¢ sets with any non-trivial property 
from the absence of a K;4; minor. 

There are more weakenings to describe yet, but they deserve a new section. 


6 Bounded Component-Size 


What if we try to improve Theorem 5.3? Let us say X C V(G) has component-size 
k if the largest component of G[X] has k vertices. Thus having bounded component- 
size is more restrictive than have bounded maximum degree, though less than what 
we really want, being stable. Instead of just saying that each G[X;] has bounded 
maximum degree, what if we ask that each of them has bounded component-size? 
It has not been proved that for graphs G with no K;+; minor, we can partition into 
t sets with this property, but there has been a series of papers proving that V(G) 
can be partitioned into a linear number of parts each with bounded component-size. 
Initially Kawarabayashi and Mohar [43] proved: 


Theorem 6.1. For all t => 0 there exists k such that if G has no K, minor, then V(G) 
can be partitioned into at most f (t) parts each with component-size at most k, where 


f(D) = [311/21]. 


Wood [84] proved the same with f(t) = [7t/2 — 3/2] (using Theorem 10.6, an 
unpublished theorem of Norin and Thomas which we discuss later), and there have 
been further improvements which we describe below, culminating in Norin’s result 
that the same holds with f(t) = 2(t— 1). 

There is a set of lemmas here that can be combined in various ways. DeVos 
et al. [16] proved: 


Theorem 6.2. For all t there exists w such that for every graph G with no K; minor, 
there is a partition of V(G) into two parts, such that the subgraph induced on each 
part has tree-width at most w. 
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Alon et al. [3] showed: 


Theorem 6.3. For all w,d and for every graph G with tree-width at most w > 3 
and maximum degree at most d = |, there is a partition of V(G) into two parts each 
with component-size at most 24wd. 


Wood [85] improved this, replacing 24kd with 5(k+ 1)(7d—2)/4. Liu (unpublished) 
has recently proved a list-colouring version: 


Theorem 6.4. For all w,d there exists k such that for every graph G with tree- 
width at most w and maximum degree at most d, and every assignment of a set Ly 
with |Ly| = 2 to each vertex v, there is a choice of c(v) € Ly for each v such that 
for each x, the set of all vertices v with c(v) = x has component-size at most k. 


Incidentally, an interesting asymmetric version was proved by Ding and Dzio- 
biak [19]: 


Theorem 6.5. For all t > 0 there exists w = 0 such that for every graph G with no 
K, minor, V(G) can be partitioned into two sets X, Y, where G|X] has tree-width at 
most w, and G[Y] is (t + 1)-degenerate. 


By combining Theorems 6.2 and 6.3, Alon et al. deduced: 


Theorem 6.6. For all t,d there exists k such that for every graph G with no K, 
minor and maximum degree at most d, there is a partition of V(G) into four parts 
each with component-size at most k. 


Recently, Liu and Oum [58] improved this, replacing “four” by “three”. If 
we then combine their result with Theorem 5.3 we deduce an improvement of 
Theorem 6.1 with f(t) = 3(t — 1). Even more recently, Norin used a different 
approach to do better. He proved the following lemma [64]: 


Theorem 6.7. For all t,w = 0 there exists N with the following property. Let G be 
a graph with |V(G)| = N, with tree-width at most w and with no K, minor. Then for 
every S C V(G) with |S| < 2w, there exists I C V(G) \ S, nonempty, such that at 
most 2w vertices in V(G) \I have a neighbour in I, and every vertex in I has at most 
t — 2 neighbours in V(G) \ I. 


With the aid of this lemma, an easy inductive argument yields: 


Theorem 6.8. For all t, w > 0 there exists k such that for every graph G with tree- 
width at most w and no K;, minor, there is a partition of V(G) into t — 1 parts such 
that each part has component-size at most k. 


Then this, combined with Theorem 6.2, yields an improvement of Theorem 6.1 
with f(t) = 2(t— 1). 

Dvorak and Norin have now announced a proof of Theorem 6.1 with f(4) = t—1, 
the best possible. 
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7 Odd Minors 


We have finished with weakenings of Hadwiger’s conjecture now; time to turn to 
strengthenings. 

Graphs that are not 2-colourable not only have a K3 minor (or equivalently, a 
cycle); they have an odd cycle. It is tempting to try to make some corresponding 
strengthening of Hadwiger’s conjecture. Here is what seems to be the most natural 
way to do it. If G is a graph and X C V(G), 6(X) denotes the set of edges of G 
with one end in X and the other in V(G) \ X. We say that F C E(G) is a cut of G 
if F = 6(X) for some X C V(G). Now let G, H be graphs. We say that H is an odd 
minor of G if H can be obtained from a subgraph G’ of G by contracting a set of 
edges that is a cut of G’. (Note that @ is a cut.) Thus a graph is not 2-colourable if 
and only if it contains K3 as an odd minor. In 1979, Catlin [12] proved: 


Theorem 7.1. Jf G has no K4 odd minor then G is 3-colourable. 


Incidentally, a much stronger statement than this has now been proved. Say a 
fully odd K4 in G is a subgraph of G which is obtained from K4 by replacing each 
edge of K4 by a path of odd length (the /ength of a path is the number of edges in it) 
in such a way that the interiors of these six paths are disjoint. Toft [77] conjectured 
in 1975 and Zang [86] proved in 1998 (and, independently, Thomassen [76] proved 
in 2001) that: 


Theorem 7.2. [f G contains no fully odd K4 then G is 3-colourable. 


Returning to odd minors: there is a result giving a construction for all graphs 
with no K, odd minor, due to Lovasz, Schrijver, Truemper and myself. It is rather 
awkward to state, and not published, although it was proved many years ago (in the 
early 1980s, and mostly on a riverboat in Bonn, if I remember correctly). We omit 
its statement here; see [33]. 

In view of its truth for ¢ < 3, Gerards and I conjectured the following 
strengthening of Hadwiger’s conjecture (see [38]): 


Conjecture 7.3. For every t > 0, if G has no K,4, odd minor, then G is 
t-colourable. 


Several of the results mentioned earlier approaching Hadwiger’s conjecture have 
extensions to odd minors. For instance, Guenin [36] announced at a meeting in 
Oberwolfach in 2005 that: 


Theorem 7.4. Every graph with no Ks odd minor is 4-colourable. 
Geelen et al. [32] proved (see also [41] for a simpler proof): 
Theorem 7.5. [If G has no K, odd minor then y(G) < O(t(logt)!/). 


Kawarabayashi and Song [47] proved an odd minor version of Theorem 4.1, 
namely: 


Theorem 7.6. Every n-vertex graph G has a K; odd minor where t=>n/(2a0(G)—1). 
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Kawarabayashi and Reed [45] proved: 
Theorem 7.7. Every graph with no K; odd minor is fractionally 2t-colourable. 
Kawarabayashi and Song [47] proved an odd minor relative of Theorem 10.2: 


Theorem 7.8. For every t > 0, there exists N such that for every (496t + 13)- 
connected graph G with at least N vertices, either G has a K; odd minor, or there 
exists X © V(G) with |X| < 8t such that G \ X is bipartite. 


Kawarabayashi [40] proved: 


Theorem 7.9. If G has no K, odd minor, then there is a partition of V(G) into 496t 
parts such that each part induces a subgraph of bounded maximum degree. 


8 Other Strengthenings 


There have been some other strengthenings of Hadwiger’s conjecture proposed, 
but they have mostly not fared so well as Conjecture 7.3. For instance, say G is 
a subdivision of H if G can be obtained from H by replacing each edge by a path, 
where the paths have disjoint interiors. Hajés conjectured in the 1940s (but did not 
publish it) that 


False Conjecture 8.1. For all t > 0, ifno subgraph of G is a subdivision of Ki+1 
then G is t-colourable. 


This is true for t < 3, but it is still open for t = 4,5, and Catlin [12] gave a 
counterexample for all ¢ > 6. Indeed, Erdés and Fajtlowicz [27] proved that almost 
all graphs are counterexamples, because of the following: 


Theorem 8.2. There are constants C,, Cz such that for almost every n-vertex graph 
G, no subgraph of G is a subdivision of K, for t > C\n'/?, and the chromatic number 
of G is at least Cyn/ log(n). 


Erd6és and Fajtlowicz conjectured and Fox et al. [30] proved that the ratio of 
the chromatic number over the clique subdivision number over all n-vertex graphs 
is maximized up to a constant factor by the random graph on n vertices; in other 
words, the uniform random graph is essentially the strongest counterexample to the 
Hajés conjecture. 

Another conjectured strengthening of Hadwiger’s conjecture was proposed by 
Borowiecki [11]. A graph G is t-choosable if for every assignment of a t-element 
set L, to each vertex v of G, it is possible to select a member c(v) € L, for each v 
such that c(u) 4 c(v) if u, v are adjacent. Borowiecki asked whether 


False Conjecture 8.3. Every graph with no K,+, minor is t-choosable. 


This is true for tf < 3, but false for t = 4; Voigt [81] gave a planar graph that is 
not 4-choosable. Thomassen [75] proved that all planar graphs are 5-choosable, but 
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an additive constant adjustment is not enough to repair the conjecture in general; 
Barat et al. [7] showed that for all t > 1 there is a graph with no K3,42 minor that is 
not 4t-choosable. Kawarabayashi and Mohar [43] conjectured: 


Conjecture 8.4. For ail t, every graph with no K, minor is 3t/2-choosable. 
A third strengthening was proposed by Ding et al. [21]: 


Conjecture 8.5. For all integers t > s > 2, if G has no K, minor, then there is a 
partition of V(G) to t—s + 1 parts, such that the subgraph induced on each part 
has no K, minor. 


For s = 2 this is Hadwiger’s conjecture, but it has not been disproved for any values 
of s,t. For s > t— 1 it is easy, and it was proved for s = t — 2 by Goncalves [34]. 
Reed and I proposed a fourth variation in [65] (this one is not a strengthening): 


Conjecture 8.6. For every graph, there is a partition of its vertex set, such that 
each part induces a connected bipartite graph, and contracting each part to a vertex 
yields a graph with no induced cycle of length more than three. 


This would imply that all graphs with no K,,, minor are 2t-colourable. It remains 
open. 

A fifth possible extension is to infinite graphs. (Henceforth, graphs may be 
infinite in this section.) By compactness, if ¢ is an integer and HC(f) holds for finite 
graphs then it also holds for infinite graphs; but we could try to extend Hadwiger’s 
conjecture to allow infinitely many colours. One might hope that 


False Conjecture 8.7. For every cardinal t, every graph with no K, minor has 
chromatic number less than t; 


but this is trivially false (let G be the disjoint union of infinitely many finite cliques, 
one of each size; then G cannot be coloured with finitely many colours, but has no 
infinite clique minor). A better formulation is: 


Conjecture 8.8. For every cardinal t, let s be the least cardinal larger than t; every 
graph with no K, minor is t-colourable. 


I believe Conjecture 8.8 remains open, but van der Zypen [80] proved the 
following: 


Theorem 8.9. For every infinite cardinal t, every graph with no subgraph which is 
a subdivision of K; is t-colourable. 


Van der Zypen’s proof uses the fact that when f is an infinite cardinal, one can 
give a construction of the graphs that contain no subdivision of K;, a result due to 
Robertson et al. [70]. It is also possible [69] to do the same for graphs that contain 
no K; minor when f is an infinite cardinal. 
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9 Immersions 


There is an interesting conjecture, parallel to Hadwiger’s conjecture, that was 
proposed by Lescure and Meyniel [57], (and independently, by Abu-Khzam and 
Langston [1], later). Let G,H be graphs. An immersion of H in G is a choice 
n(v) € V(G) for each v € V(A), all distinct, and a choice n(e) for each e € E(G), 
where for e = uv, n(e) is a path of G between n(u) and n(v), and all the paths 
n(e) are pairwise edge-disjoint (they may share vertices; and an end-point of one 
path may be an internal vertex of another). Let us say G immerses H if there is an 
immersion of H in G. Lescure and Meyniel proposed: 


Conjecture 9.1. For every integer t => 0, every graph that does not immerse Ki+1 
is t-colourable. 


This neither implies nor is implied by Hadwiger’s conjecture, since immersing 
K,+1 neither implies nor is implied by having a K;+; minor; but it is in some respects 
similar. (In one respect it is very different: planar graphs can immerse huge complete 
graphs.) Conjecture 9.1 was proved for t < 6 by Lescure and Meyniel (though 
they did not publish the proof for tf = 6), and more recently DeVos et al. [18] 
published a proof for t = 6. Both sets of authors used the same approach, proving 
the stronger statement that for ¢ < 6, every simple graph with minimum degree at 
least t immerses K;. For t > 9, it is not true that every graph with minimum degree at 
least t immerses K;; but DeVos et al. [17] proved the following (in fact they proved 
it for “strong” immersion, in which the vertices n(v) are not permitted to be internal 
vertices of the paths n(e)): 


Theorem 9.2. For all t => 0, every graph of minimum degree at least 200t 
immerses K;. 


Recently, Dvorak and Yepremyan [25] have improved this: 


Theorem 9.3. For all t > 0, every graph of minimum degree at least 11t + 7 
immerses K,. 


It follows that 


Theorem 9.4. Every graph that does not immerse K, is 11t + 7-colourable. 


10 Big Graphs 


The constructions of Kostochka and Fernandez de la Vega mentioned earlier show 
that there are graphs with no K, minor with average degree of the order of t(log t)!/”, 
and indeed their minimum degree and connectivity are also of this order. But there 
is a feeling that honest, sensible graphs with no K, minor are not really like this; 
they will have vertices of degree about t. How can we make this intuition closer to 
a true statement? 
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The intuition comes mostly from the Graph Minors structure theorem of 
Robertson and myself [68], which says very roughly that to make graphs with no 
K;41 minor, one takes graphs on surfaces of bounded genus and adds a bounded 
number of extra vertices; and if these extra vertices are not just attached to small 
parts of the surface, there had better not be many of them (or else we will get a K;+1 
minor); in fact at most ¢t — 4 of them, and fewer if the surface is not the plane. But 
vertices in the surface have average degree (in the surface) less than six, so total 
degree less than t + 2. There are have been several attempts to bring this very vague 
argument closer to reality, and in this section we discuss some of them. 

The feeling is that the examples of Kostochka and Fernandez de la Vega have 
only bounded size (which they do) in some essential way (which remains to be 
made precise). Of course we can make bigger examples by taking disjoint unions of 
the little ones, but then the connectivity is lost. What if we impose some connectivity 
restriction? Can there still be large examples? 

Thomas and I conjectured: 


Conjecture 10.1. For all t > 0 there exists N such that every (t — 2)-connected 
graph G with no K,; minor and with n = N vertices satisfies 


|E(G)| < (¢— 2)n— (¢-— D(t— 2)/2. 


This remains open. BoOhme et al. [8] proved: 


Theorem 10.2. For all positive integers t, there exists N such that every 3t + 2- 
connected graph with no K, minor and with at least N vertices has a vertex of degree 
less than 31(t + 1)/2 —3. 


This is very encouraging: everything is linear, the frightening (log t)!/? term has 
disappeared. 

How can we arrange some decent connectivity? To prove HC(f) it is enough 
to prove the impossibility of minimal or minimum counterexamples to HC(f) 
(a counterexample is “minimal” if no proper minor of itself is a counterexample; 
and “minimum” if no counterexample is smaller.) What about the connectivity of 
minimal counterexamples? Kawarabayashi [42] proved: 


Theorem 10.3. Fort > 0, every minimal counterexample to HC(t) is [2(t+1)/27]- 
connected, and every minimum counterexample to HC(t) is [(t + 1)/3]-connected. 


Mader [61] proved that for any value of t, if G is a minimal counterexample 
to HC(#) then G is 6-connected, and 7-connected if t > 6. When t = 5 this 
is particularly interesting, because it means that to prove HC(5) we only have to 
consider 6-connected graphs without Kg minors. And Jgrgensen [39] conjectured 
the following: 


Conjecture 10.4. Every 6-connected graph with no Ke minor is apex. 


(We recall that a graph is apex if it can be made planar by deleting one vertex, 
and in particular all apex graphs are 5-colourable.) Thus if only we could prove 
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Jgrgensen’s conjecture, we would obtain a much more appealing proof of HC(5). 
Unfortunately it remains open; but it might point a way to solve Hadwiger’s 
conjecture in general, if we could only figure out an analogue of this conjecture 
for larger values of t (and then figure out how to prove it). 

Kawarabayashi et al. [44] proved that Conjecture 10.4 itself is true in large 
graphs: 


Theorem 10.5. There exists N such that every 6-connected graph with at least N 
vertices and with no K6 minor is apex. 


More recently Norin and Thomas have announced the following analogue of 
Conjecture 10.5 for general values of ¢ (this is a difficult result with a huge proof, 
and is still being written at this time): 


Theorem 10.6. For all t => 0 there exists N such that every t + 1-connected graph 
with at least N vertices and with no K;4, minor can be made planar by deleting t—4 
vertices. 


So it would be nice to know that minimal counterexamples to HC(#) are t + 1- 
connected; but we do not know this. 

But recently it may have been shown that in fact there are no large minimal coun- 
terexamples to HC(f), using a feature of them slightly different from connectivity. 
A cutset of G means (in this paper) a partition (A, B,C) of V(G) with A,B # 9, 
such that there are no edges between A and B. A one-way clique cutset of G means 
a cutset (A, B, C), and for each v € C aconnected subgraph X, C BU C containing 
v, such that X,,, X, are disjoint and some edge has an end in X,, and an end in X,, 
for all distinct u, v € C. In other words, we can turn C into a clique by contracting 
edges within BU C. Suppose that G is a minimal counterexample to HC(#), for some 
value of t. Then it is easy to show that: 


* no vertex has degree at most f; 

* no vertex of degree t+ 1 has three nonadjacent neighbours; 
e there is no one-way clique cutset; and 

* Gcannot be made planar by deleting ¢ — 4 vertices. 


Robertson and I announced (about 1993) that we proved: 


Theorem 10.7. For all t => 0 and for every graph G with no K,4, minor, if G 
satisfies the four bullets above then G has bounded tree-width. 


This had all kinds of pleasing consequences, but the proof was very long, and was 
never written down, and now it is lost. Fortunately, almost the same thing, and with 
the same desirable consequences, has recently been announced by Kawarabayashi 
and Reed [46], and their proof seems more manageable, and may get written 
down. (At the moment the proof sketched in [46] has developed a few cracks, 
but Kawarabayashi maintains it can be fixed.) They added a fifth bullet to the four 
above: 
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e there do not exist a cutset (A,B,C) of G and disjoint connected subgraphs 
X,,...,X, of of GIA U C], each including a stable subset of C, such that if B’ 
denotes the graph obtained from B U X; U--- U X; by contracting the edges of 
X,,...,X;, then every t-colouring of B’ extends to one of G. 


This evidently also holds in any minimal counterexample to HC(t). They claim: 


Theorem 10.8. For all t => 0 and for every graph G with no K;41 minor, if G 
satisfies the five bullets above then G has bounded tree-width. 


This would have several consequences. The most important is probably an 
explicit function f(t) such that for all t, every minimal counterexample to HC(t) 
has at most f(t) vertices. 
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The Hadwiger—Nelson Problem 


Alexander Soifer 


Abstract Inspired by the Four-Color Conjecture, the Hadwiger—Nelson Problem 
became one of the famous open problems of mathematics in its own rights. It 
has withstood all assaults for 65 years, and attracted many mathematicians from 
many fields, including Paul Erdés and Ronald L. Graham. John F. Nash admired 
this problem and chose it for the present book. In this chapter we will discuss this 
problem, its history and generalizations, several of the many related open problems, 
and the state of the art results. ! 


1 The Problem 


The title problem asks to find the smallest number of colors sufficient for coloring 
the points of the Euclidean plane E? in such a way that no two points of the same 
color are unit distance apart. 

This number is called the chromatic number of the plane (CNP) and is denoted 


by x(E’). 


2 The History and the Authorship of the Problem 


While the distinguished Swiss geometer Hugo Hadwiger admired this problem 
and wrote about it in the early 1960s, the problem was posed earlier by one 
remarkable young person. As documented in [49], the problem was created in 
October-November 1950 by the 18-year old Edward Nelson (May 4, 1932, Decatur, 


‘Unlike the previous version of this survey [55], more history of the problem and its authorship 
has been included here. 
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GA-September 10, 2014, Princeton, NJ), who also determined a lower bound 4; his 
20-year old friend John Isbell found an upper bound 7: 4 < y(E*) <7. 

During his talk at 25th South Eastern International Conference on Combina- 
torics, Computing and Graph Theory in Boca Raton, Florida at 9:30-10:30 A.M. 
on March 10, 1994, Paul Erdés summarized the results of the author’s historical 
research on the authorship of this problem in a characteristically Erdésian style [17]: 


“There is a mathematician called Nelson who in 1950 when he was an epsilon, that is he 
was 18, discovered the following question. Suppose you join two points in the plane whose 
distance is 1. It is an infinite graph. What is chromatic number of this graph? 

Now, de Bruijn and I showed that if an infinite graph which is chromatic number k, it 
always has a finite subgraph, which is chromatic number k. So this problem is really [a] 
finite problem, not an infinite problem. And it was not difficult to prove that the chromatic 
number of the plane is between 4 and 7. I would bet it is bigger than 4, but I am not sure. 
And the problem is still open. 

If it would be my problem, I would certainly offer money for it. You know, I can’t offer 
money for every nice problem because I would go broke immediately. I was asked once 
what would happen if all your problems would be solved, could you pay? Perhaps not, 
but it doesn’t matter. What would happen to the strongest bank if all the people who have 
money there would ask for money back? Or what would happen to the strongest country if 
they suddenly ask for money? Even Japan or Switzerland would go broke. You see, Hungary 
would collapse instantly. Even the United States would go broke immediately .. . 

Actually it was often attributed to me, this problem. It is certain that I had nothing to do 
with the problem. I first learned the problem, the chromatic number of the plane, in 1958, 
in the winter, when I was visiting [Leo] Moser. He did not tell me from where this nor the 
other problems came from. It was also attributed to Hadwiger but Soifer’s careful research 
showed that the problem is really due to Nelson.” 


It is therefore fitting to call this problem The Nelson Problem or, as it is often 
called, The Chromatic Number of the Plane Problem (CNP). 

There are fascinating similarities between the famous Four-Color Problem (4CP) 
and the Nelson Problem (CNP). 4CP was created during 1850-1852 timeframe by 
the 18-20 year old Francis Guthrie. CNP was created a century later, in 1950 by the 
18-year old Edward Nelson. Augustus De Morgan was instrumental in keeping 4CP 
alive for decades. Paul Erdés, like De Morgan a century earlier, kept the flaming 
torch of the problem lit. He made the chromatic number of the plane problem well 
known by posing it in his countless problem talks and many publications. 

In fact, the starting point of Edward Nelson in creating CNP problem was 4CP. 
In his October 5, 1991, letter [33], he conveyed to the author the Story of Creation: 


“Dear Professor Soifer: 

In the autumn of 1950, I was a student at the University of Chicago and among 
other things was interested in the four-color problem, the problem of coloring graphs 
topologically embedded in the plane. These graphs are visualizable as nodes connected 
by wires. I asked myself whether a sufficiently rich class of such graphs might possibly 
be subgraphs of one big graph whose coloring could be established once and for all, for 
example, the graph of all points in the plane with the relation of being unit distance apart 
(so that the wires become rigid, straight, of the same length, but may cross). The idea did 
not hold up, but the other problem was interesting in its own right and I mentioned it to 
several people.” 
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Both problems, 4CP and CNP, required a very long time to be conquered. Victor 
Klee and Stan Wagon [54], observing that solving 4CP took 124 years, suggested 
that CNP might require as long for its solution: “If a solution of CNP takes as long 
as 4CC, then we will have a solution by the year 2084.” Will we succeed by 2084? 
Paul Erdés would have said, “We shall see!” The great composer Arnold Schoenberg 
believed that faith can move mountains. Erdés urged us to believe that the transfinite 
Book of all theorems and their best proofs exists. A similar belief led Appel and 
Haken to succeed in solving 4CP at the breaking point of available computing. 
Such a belief is needed to conquer CNP, the author’s favorite open problem of 
mathematics. 


3 Translation of the Nelson Problem into the Language 
of Graph Theory 


Graph Theory was born out of our neglecting geometric considerations of shape and 
size, and preserving only adjacency. Surprisingly, the past century has witnessed 
a renewed interest in graphs, where geometrical considerations such as distance 
matter, in fact, define the adjacency. 

We can create a graph G(E*) out of the Euclidean plane E” by taking all of 
its points as vertices, and joining two vertices by an edge if and only if they are 
at distance 1 apart. More generally, we call a graph unit-distance when any two 
vertices are adjacent if and only if they are at distance | apart. In this language, the 
Nelson Problem can be translated as follows: 


The Nelson Problem 1 Find the chromatic number of the graph G(E’). 


After 65 years of intensive work by many scholars, using tools from graph theory, 
geometry, abstract algebra, topology and measure theory, we have been unable to 
improve on the above bounds for y(E”) in the general case. Ronald L. Graham 
believes that the chromatic number of the plane is 5 or 6 (see [21, 22]). He cites 
a theorem of Paul O’Donnell (see [34, 49]) showing the existence of 4-chromatic 
unit-distance graphs of arbitrarily large girth (Theorem 28 below) as “perhaps, the 
evidence that x is at least 5.” Paul Erdés [14, 15] believed that the chromatic number 
of the plane was 5, 6, or 7. The author conjectured the answer to be 7. 


4 The Polychromatic Number: The Lower Bound 


When a problem withstands all assaults, mathematicians create related problems, 
solutions of which may shed light on the original problem. Let us look at one of 
them here. We say that a point set S realizes distance d if S contains two points at 
distance d apart. In 1958 Erdos posed the following question. 
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Fig. 1 The Mosers spindle A 


Polychromatic Number of the Plane Problem 2 What is the smallest number of 
colors needed to color the plane so that no color realizes all distances? 


In 1992 Soifer [43] named this invariant the polychromatic number of the plane, 
and denoted by Gh). Clearly Xp(E*) < (E*) <7. In 1970 upper and lower bounds 
for Xp(E’) were published by a Russian high-school student, Dmitry E. Raiskii [40]. 
We look at the lower bound here and return to the upper bound in Sect. 6. In 1961 
the brothers Leo and Willie Moser produced [31] a seven-point plane configuration 
that we now call The Mosers Spindle (Fig. 1): each edge in the spindle has length 1. 


Lemma 3 Any three points of the Mosers Spindle contain two points at distance 
1 apart. Consequently, in any coloring of a Mosers Spindle that forbids monochro- 
matic distance 1, at most two points can be of the same color. 


Raiskii [40] proved the following lower bound. 
Theorem 4 7,(E°)>4 


Proof The following striking proof was found in 1997 by Alexei Merkov, another 
Russian high-school student, and appeared in an obscure brochure [37]. It is 
presented here with modifications by the author. We assume that the plane is colored 
in three colors, red, white and blue, and that each color forbids a distance r, w and b, 
respectively. Equip the three-colored plane with Cartesian coordinates with origin 
O, and construct three 7-point sets S,, S,, and S,, each being the Mosers Spindle 
(Fig. 2), in such a way that all three spindles share O as one of their seven vertices 
and have edge lengths all equal to r, w and b, respectively. This construction defines 
18 vectors: six ‘red’ ones, v1, v2, ..., V6 from the origin O to each remaining point 
of S,, six “white’ ones, v7, vg, .. ., Vi2 from O to the points of S,,, and six ‘blue’ ones 
V13, Vi4,---, Vig from O to the points of Sp. 

We now introduce 18-dimensional Euclidean space E'® and a function M: E!® > 
E* that maps each vector (a1, d2,..., Gig) tO a)Vj + Gov2 +... + Aigvig. This 
function induces a three-coloring of E'® by assigning to a point of E'® the color of 
the corresponding point of the plane. In accordance with the colors of the vectors vj, 
we call the first six axes of E!® ‘red’, the next six axes ‘white’, and the last six axes 
‘blue.’ 
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Fig. 2. The Cartesian plane 
with three Mosers Spindles 


Let W be the subset of E'® consisting of all points whose coordinates include at 
most one equal to | for each of the three colors of the axes, and the remaining (15 
or more) coordinates equal to 0; it is easy to verify that W has 7° = 343 points. For 
any fixed array of coordinates allowable in W on white and blue axes, we get the 
7-element set A of points in W with these fixed coordinates on white and blue axes. 
The image M(A) forms in the plane a translate of the original 7-point set S,. If we 
fix another array of white and blue coordinates, we obtain another 7-element set in 
E'8, whose image under M in the plane would form another translate of S,. Thus, 
the set W gets partitioned into 49 subsets, each of which maps onto a translate of S,. 

By Lemma 3, any translate of S, has at most two red points among its 7 points. 
Since W has been partitioned into translates of S,, at most 7/7 of the points of W 
are red. We can now start all over again, and similarly show that at most 7/7 of the 
points of W are white, and at most 2/7 of the points of W are blue. But 2 fy 42 Iy 42 ly 
is less than 1. This contradiction implies that at least one of the colors realizes all 
distances, as required. 


5 The de Bruijn—Erdés Theorem 


We can expand the notion of the chromatic number to any subset S of the plane. 
The chromatic number y(S) is the smallest number of colors sufficient to color the 
points of S in such a way that forbids monochromatic pairs of points at distance 1 
apart. In other words, we create on S a unit distance graph, with vertex set S and 
two points adjacent if and only if they are distance | apart, and denote by y(S) the 
chromatic number of this graph. 
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In 1951 Nicolaas G. de Bruijn and Paul Erdés [11] published a lemma that 
implies the following important ‘compactness theorem’; its proof assumes the 
axiom of choice. 


The De Bruijn-Erdés Theorem 5 The chromatic number of the plane is equal to 
the maximum chromatic number of its finite subsets. 


Accordingly, Erdés used to say that the problem of finding the chromatic number 
of the plane is a problem about finite sets in the plane. In 1975 he posed the following 
problem [13]: 


Erdés’ $25 Problem 6 Let S be a subset of the plane which contains no equilateral 
triangle of side length 1. Join two points of S if their distance is 1. Does this graph 
have chromatic number at most 3? If the answer is no—assume that the graph 
defined by S contains no C; [cycles of length /] for 3 < / < t and ask the same 
question. 


Erd6s was unsure of the outcome (which was rare for him): he expected triangle- 
free unit distance graphs to have chromatic number at most 3, or else that chromatic 
number 3 can be forced by prohibiting all small cycles up to C;,, for sufficiently 
large t. 

In 1979 Nicholas Wormald [53] disproved the first, easier conjecture, contained 
in Problem 6, and Erdés promptly paid $25 and reported the result in a lecture and 
later in print [16]: 

In a recent paper (still unpublished), Wormald found a set S for which the unit distance 

graph G;(S) has girth 5 and chromatic number 4. His construction involved elaborate 

computations and is fairly complicated. Indeed, aided by a computer, he had proved in 


[49] the existence of a set S of 6448 points with chromatic number 4 and without triangles 
or quadrilaterals with all sides of length 1. 


In 1992 the author in a talk posed this Erdés’ $25 Problem, in a form of 
competition: 


Problem 7 Find the smallest number o4 of points in a plane set with chromatic 
number 4 without unit equilateral triangles, and classify all such sets S of o4 points. 


Several young mathematicians entered the race, and the graphs obtained by the 
record setters were as mathematically significant as they were beautiful (see [49]). 
The two record-holding graphs, created by roommates-graduate students of Rutgers 
University Robert Hochberg and Paul O’Donnell [24], are shown in Figs. 3 and 4; 
both graphs are 4-chromatic unit-distance graphs. 


6 The Polychromatic Number: The Upper Bound 


An example proving an upper bound Xp(E’) <6 was found by Sergei B. Stechkin 
(see Fig. 5) and published by Raiskii [40]. 

The ‘unit of construction’ is a parallelogram consisting of four regular hexagons 
and eight equilateral triangles, all of side-length 1. We color the hexagons with 
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Fig. 3. The 
Hochberg—O’ Donnell fish 
graph, of girth 4 and order 23 


Fig. 4 The 
Hochberg—O’ Donnell star 
graph, of girth 5 and order 45 


Fig. 5 Stechkin’s six-coloring of the plane 
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Fig. 6 Two squares that 
induce the tiling of the plane 
with octagons and squares 


Fig. 7 Boundary points 
included in the colors of the 
polygons are shown in bold 


Fig. 8 Soifer’s six-coloring 
of the plane 


colors 1, 2, 3 and 4. We then partition the triangles into two types: assign color 5 
to the triangles with a vertex below their horizontal base, and color 6 to those with 
a vertex above their horizontal base. While coloring, assume that every hexagon 
includes its entire boundary, except for its one right-most and two lowest vertices, 
and that every triangle includes none of its boundary points. We can now tile the 
entire plane with translates of the “unit of construction’. 

If our ultimate goal is to find the chromatic number of the plane, or at least to 
improve its known bounds, it might be worthwhile to ‘measure’ how close a given 
coloring of the plane is to achieving this goal. In 1992 such a measurement was 
introduced by the author and named the coloring type (see Soifer [43, 44]): given 
an n-coloring of the plane for which color i does not realize the distance d; (for 
1 <i<n), we say this coloring is of type (d1,do, .. ., dn). 

It would improve our search for the chromatic number of the plane if we could 
find a six-coloring of type (1, 1, 1, 1, 1, 1), or show that one does not exist. 
With an appropriate choice of unit, Stechkin’s coloring in Fig. 5 has type (1, 1, 
1, 1, '/, '/). In 1973 Woodall [52] found a second six-coloring of the plane 
with no color realizing all distances; his coloring had a property that each of 
the six monochromatic sets was closed. However, his example had three ‘missing 
distances’: it had type (1, 1, 1, 1/3, 1/3, 1/23). In 1991 a new six-coloring 
was found by the author [44], using a tiling of the plane by squares and non-regular 
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octagons; it had type (1,1,1,1,1, 1/4/5). To construct it, we start with two squares, 
one of side 2 and the other of diagonal | (see Fig. 6), and use them to create a 
tiling of the plane with squares and non-regular octagons (see Fig. 8); colors 1-5 
were used for the octagons, and all the squares were colored 6. With each octagon 
and each square we include half of its boundary (bold lines in Fig. 7) without the 
endpoints of that half. It is easy to verify that \/5 is not realized by any of the colors 
1-5, and 1 is not realized by color 6. By shrinking all linear sizes by a factor of /5, 
we obtain the six-coloring of type (1,1,1,1,1, 1/\/5). To simplify the verification, we 
define the unit of construction as the region bounded by the bold line in Fig. 8; its 
translates tile the plane. 

The above six-coloring gave birth to a new definition (see [25]). The almost 
chromatic number x,(E*) of the plane is the minimal number of colors that are 
required to color the plane so that all but one of the colors forbid unit distance, 
and the remaining color forbids a distance, which is not necessarily unit. Note that 
4< XalE’) <6; the lower bound follows from Raiskii [40], and the upper bound 
follows form the above six-coloring. The problem of determining 7,(E’) is still 
open (see [49]). 


7 The Continuum of Six-Colorings 


In 1993 another six-coloring was found by the 15-year old violinist Ilya Hoffman 
and the author (see [25, 26]), with type (1, 1, 1, 1, 1, f= 1); the story of its 
discovery can be found in [49]. To construct it we first tile the plane with squares 
of diagonals 1 and ./2 —1 (see Fig. 9). We use colors 1-5 for larger squares and 
color 6 for all small squares. With each square we include the left and lower sides 
of its boundary without the endpoints of this half (see Fig. 10). To verify that this 
coloring does the job, define the unit of construction that is bounded by the bold line 
in Fig. 9; its translates tile the plane. 

In 1993 the above two examples prompted the introduction of new terminology 
and the translation of earlier results and problems into this new language (see [45, 
46]). We let X¢ denote the 6-realizable set of all positive numbers a for which there 
exists a six-coloring of the plane of type (1, 1, 1, 1, 1, w). The problem, posed by 
the author, which is still open and extremely difficult, is to find X¢. 

We know, from our above discussion, that 1/ J/5 and /2-1 both lie in Xo. AS 
shown by the author in [45, 46] (see also [49]), these are extreme examples of the 
general case, which includes a continuum of ‘working’ six-colorings: for every a 
between /2-1 and VJ/5, there is a six-coloring of type (1, 1, 1, 1, 1, @). 


Theorem 8 X¢ contains the entire closed interval [/2 -1,1/ /5]. 


Outline of Proof We tile the plane with congruent non-regular octagons and ‘small’ 
squares (see Fig. 11). We now color this tiling in six colors. Denote by F the unit 
of our construction, bounded by a bold line and consisting of five octagons and four 
‘small’ squares. Use colors 1-5 for the octagons inside F and color 6 for all ‘small’ 
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squares, and include in the colors of octagons and ‘small’ squares those parts of 
their boundaries that are shown in bold in Fig. 12. This is followed by a proof that 
for each a from the given in the theorem interval, we can vary the relative sizes of 
the hexagons and squares and the angle between them to guarantee that this six- 
coloring has type (1, 1, 1, 1, 1, w). We have found a continuum of values for a and 
a continuum of ‘working’ six-colorings of the plane. 


8 The Chromatic Number of the Plane in Special 
Circumstances 


In 1973 Douglas R. Woodall [52] attempted to prove a lower bound for the 
chromatic number of the plane for the special case of ‘map-type colorings’. 
However, in 1979 Stephen P. Townsend constructed a counter-example that showed 
that one essential idea of Woodall’s proof was incorrect. By that time, Townsend 
had already proved the same result, and his proof was much more elaborate 
than Woodall’s attempt. For decades Townsend’s proof was unavailable until he 
produced a clear version of it where the definition of the map-type coloring can 
also be found (see [49]). The following result was thus conjectured by Woodall and 
proven by Townsend. 


The Townsend—Woodall Theorem 9 The chromatic number of the plane under 
map-type coloring is 6 or 7. 


Woodall [52] showed that this result implies another result worth mentioning. 


Fig. 9 Hoffman and Soifer’s 
six-coloring of the plane 
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Fig. 10 Boundary points 
included in the colors of the 
squares are shown in bold 


Fig. 11 Continuum of 
six-colorings of the plane 


Fig. 12 Boundary points 
included in the colors of the 


polygons are shown in bold ry 


Theorem 10 The chromatic number of the plane under coloring with closed 
monochromatic sets is 6 or 7. 


In 1993-94 three American undergraduate students, Nathanial Brown, Nathan 
Dunfield and Greg Perry, proved that a similar result is true for coloring with open 
monochromatic sets (see [3—5]). 


Theorem 11 The chromatic number of the plane under coloring with open 
monochromatic sets is 6 or 7. 


A related graph-theoretic result was obtained by Carsten Thomassen ([49, 51]). 
He proved that for a “nice” coloring of a connected graph on a surface, such as the 
plane, seven colors are needed. 
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Meanwhile, Kenneth J. Falconer [18], while still a graduate student, proved the 
following important result, that the measurable chromatic number xm(E*) of the 
plane is 5, 6 or 7. 


4 
The Falconer Theorem 12 If E? = UAi is a covering of the Euclidean plane E” 


by four disjoint measurable sets, then one of the sets A; realizes distance 1. 


Decades later, Falconer wrote a much more detailed and self-contained exposi- 
tion of his result especially for [49]. Theorem 12 means that if a four-coloring of the 
plane forbids monochromatic distance 1, then one of the classes is non-measurable. 


9 Space Explorations 


Around 1961 Erdos generalized the problem of finding the chromatic number of the 
plane to n-dimensional Euclidean space E”. He was interested both in asymptotic 
behaviour as n increases, and in exact values of the chromatic number x(E”) for 
small n (especially n = 2 and 3). 

In 1970 Raiskii [40] proved that y(E") > n+ 2, for all n>1; thus, for n= 3 we 
have y(E?) > 5. This lower bound for E? lasted until 2000, when Oren Nechushtan 
proved that y(E?) > 6 (see [32]). For upper bounds, David Coulson [9, 10] proved 
that y(E°) < 15, by using a face-centred cubic lattice (see Conway and Sloane [8] for 
more on cubic lattices). For higher dimensions, Kent Cantwell [6] proved in 1996 
that y(E* )>7 and y(E°) > 9; these remain the best results known. Then in 2008 the 
Czech student Josef Cibulka [7] proved that Y(E°) >11. 

Many years ago, Erdés conjectured that the chromatic number x(E”) increases 
exponentially with n. This conjecture was settled in the affirmative by two results, 
an exponential upper bound, found in 1972 by D. G. Larman and C. A. Rogers [28], 
and an exponential lower bound obtained in 1981 by P. Frankl and R. M. Wilson 
[20]: 


For alln, (1 + o(1)) 1.2" < y(E") < (34+ 0(1))". 


Asymptotically, Larman and Rogers’s upper bound remains the best known 
today. In 2000 Frankl and Wilson’s asymptotic lower bound was improved by 
Andrei Raigorodskii [39] to (1.239... + 0(1))”. Narrowing the remaining gap is 
desirable. 

The polychromatic number Xp(E*) also generalizes to higher dimensions. Raiskii 
[40] proved that y,(E") = n + 2, for all n > 1. Larman and Rogers’s upper bound 
implied y,(E") < (3 + o(1))”. Their conjecture that y,(E") grows exponentially in 
n, was proved by Frankl and Wilson [20], who showed (1 + o0(1)) 1.2” < yp(E"). 


The Hadwiger—Nelson Problem 451 
10 The Chromatic Number of Rational Spaces 


Another approach to the chromatic number of the plane E” is to use Cartesian 
coordinates. Here, E” is the set of all ordered pairs (x, y) with real coordinates x and 
y, with the distance between two points defined in Euclidean way. Since it suffices to 
deal with finite subsets of E?, by de Bruijn and Erdés’ Theorem 3.1, we can restrict 
the coordinates to some subset C of E. The problem is: which subset should we 


choose? 


Problem 13 Find a countable subset C of the set of real numbers E whose 
chromatic number x(C?) equals that of the plane. 


The set Q of all rational numbers does not work, as shown by Woodall [52]: 
Theorem 14 7(Q7) = 2. 


In 1975 there appeared ‘the legendary unpublished manuscript’, as P. D. Johnson, 
Jr. referred to the manuscript by Miro Benda and Micha Perles. This widely 
circulated manuscript was called Colorings of Metric Spaces; Johnson tells its story 
in the Geombinatorics, where in January 2000 the Benda—Perles paper was finally 
published [2]. It included the following results: 


Theorem 15 y(Q*) = 2 and y(Q*) = 4. 
Benda and Perles [3] then posed some important open problems. 


Problem 16 Find y(Q°) and, in general, y(Q”). 


2 
Problem 17 Find the chromatic number of o( V2) and, in general, of any 


algebraic extension of Q?. 


This direction was developed by P. D. Johnson, Jr., Joseph Zaks, Klaus Fischer, 
Kiran B. Chilakamarri, Michael Reid, Douglas Jungreis, David Witte, and Timothy 
Chow. In 2006 Johnson [29] published in Geombinatorics “A tentative history and 
compendium” of this direction of inquiry. 

More recently Matthias Mann [29] has proved that y(Q>) > 7. This jump from 
x(Q*) = 4 explains the difficulty of finding y(Q°), whose exact value is still 
unknown. Mann [30] then obtained further lower bounds: x(O°)>10; x(O")=13; 
x(O8)> 16. In 2008 Cibulka [7] obtained new lower bounds for these chromatic 
numbers, improving some of Mann’s results: y(Q>)>8 and 7(Q’)>15 


11 One Odd Graph 


In 1994 Moshe Rosenfeld [41] defined the odd-distance graph Ey aa to be the graph 
with vertex-set E” in which two vertices are adjacent whenever the distance between 
them is an odd integer. He showed that Eoaq does not have a subgraph K4, and 
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asked whether the chromatic number of Eoaq is finite. In fact, while the problem was 
new, the absence of K4-subgraphs was not, following from a more general result of 
Graham, Rothschild and Straus [23]: 


Theorem 18 Jn E” there exist n+ 2 points for which the distance between any two 
of them is an odd integer if and only if n= 14 (mod 16). 


In the necessary part of the proof, the authors used a Victorian result about 
determinants by Arthur Cayley. The main problem remains wide open: 


Problem 19 Find y(Eoaa). 


We do not even know whether y(Eoaa) is finite. In 2009 Ardal, Manuch, 
Rosenfeld, Shelah and Stacho [1] improved the lower bound to y (Eoua) > 5. Let 
me pose here, for the first time, the following conjecture: 


Conjecture 20 y(Eoaa) = Xo. 


In fact, let us denote the measurable chromatic number of the odd-distance graph 
by Ym(Eoaa). In 1986 Falconer and Marstrand [19] proved that plane sets with 
positive density at infinity contain all large distances. This proves the conjecture 
in a special case: 7(Eoaa) = Xo- 


12 The Influence of Set Theory Axioms 
on Combinatorial Results 


Here we need some ideas from set theory. The standard Zermelo—Fraenkel—Choice 
system of axioms for set theory will be denoted by ZFC; the countable axiom of 
choice by ACy,, the principle of dependent choices by DC. We will use one further 
axiom, LM: every set of real numbers is Lebesgue measurable. Our first task is to 
extend the definition of the chromatic number of a graph. Without the axiom of 
choice, the chromatic number of a graph may not exist. In allowing a system of 
axioms for set theory to exclude the axiom of choice, we need to create a much 
broader definition of the chromatic number than the usual one, if we want it to exist. 
In fact, instead of the chromatic number we ought to talk about the set of chromatic 
cardinalities. There are several meaningful ways to define this. Here is one that the 
author chose in [49]. 


Definition 21 Let G be a graph and let A be a system of axioms for set theory. 
The set of chromatic cardinalities y4(G) of G is the set of all cardinal numbers 
t < |V(G)| for which there is a proper coloring of the vertices v(G) of G in t 
colors, and t is minimum with respect to this property. 


As can be seen, the set of chromatic cardinalities needs not have just one element 
as with A= ZFC. It can also be empty. The advantage of this definition is its 
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simplicity. Best of all, we can use inequalities on sets of chromatic cardinalities 
as follows. 

Let t be a cardinal number. The inequality y4(G) > t means that, for every o € 
x4(G), o > T; the inequalities <, < and > are defined analogously. We also agree 
that the empty set is greater than or equal to any other set of cardinal numbers. 
Finally, if t is a cardinal number and y4(G) = {r} is a one-element set of chromatic 
cardinalities (as is the case with the chromatic number when A = ZFC), then we 
will simplify our notation by omitting parentheses and writing y4(G) = t. 

An infinite cardinal &, is regular if cf @ = @q, and x is a Strong limit cardinal 
if, for every cardinal A, A <« implies that 2+ <«. A cardinal x is called inaccessible 
if k > No, « is regular, and « is a strong limit cardinal. Assuming the existence 
of an inaccessible cardinal, and using Paul Cohen’s forcing, Robert Solovay [50] 
constructed in 1964 and published in 1970 a model that proved a remarkable 
theorem. In his honor the author introduced the following definitions [49]. 

The Zermelo—Fraenkel-Solovay system of axioms ZFS for set theory is defined 
by ZFS = ZF + ACy, + LM, and ZFS-+ stands for ZF + DC + LM. 

Solovay’s theorem can now be formulated very concisely: 


Solovay’s Theorem 22 ZFS-+ is consistent. 


Saharon Shelah and Alexander Soifer [42] constructed the following example. 
Define a graph G as follows: the vertex-set is the set of real numbers, and the set 
of edges is {(s, t): s — t— /2 € Q}. They proved that for this graph y7*°(G) = 2, 
while 7x2*(G) > Xo. 

This example and its analogues in 2- and more generally n-dimensional 
Euclidean spaces prompted Jean-Paul Delahaye [12] to point out philosophical 
questions of the foundations stemming from these examples: 


“Tt turns out that knowing if the world of sets satisfies the axiom of choice or a competing 
axiom is a determining factor in the solution of problems that no one had imagined depended 
on them. The questions raised by the new results are tied to the fundamental nature of the 
world of sets. Is it reasonable to believe that the mathematical world of sets is real? If it 
exists, does the true world of sets—the one in which we think we live—allows the coloring 
of S. Shelah and A. Soifer in two colors or does it require an infinity of colors? ... 

A series of results concerning the theory of graphs, published in 2003 and 2004 
by Alexander Soifer of Princeton University and Saharon Shelah, of the University of 
Jerusalem, should temper our attitude and invite us to greater curiosity for the alternatives 
offered by the axiom of choice. The observation demonstrated by A. Soifer and S. Shelah 
should force mathematicians to reflect on the problems of foundations: what axioms must 
be retained to form the basis of mathematics for physicists and for mathematicians?” 


Delahaye observes (ibid): “These [Shelah-Soifer’s] results mean, as with the par- 
allels postulate, that several different universes can be considered,” and continues: 


“In the case of geometry, the independence of the parallels postulate proved that non- 
Euclidian geometries deserved to be studied and that they could even be used in physics: 
Albert Einstein took advantage of these when, between 1907 and 1915, he worked out his 
general theory of relativity. 

Regarding the axiom of choice, a similar logical conclusion was warranted; the universes 
where the axiom of choice is not satisfied must be explored and could be useful in physics.” 
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Similar examples with the plane E*, and in general E”, as the vertex-set, were 
constructed in [48] and [47] respectively. These examples illuminate the influence 
of the system of axioms for set theory on combinatorial results. They also suggest 
that the chromatic number of E” may not exist ‘in the absolute’ (that is, in ZF), but 
may depend upon the system of axioms chosen for set theory. An important example 
later came from the Australian student Michael Payne [35], who started with the 
unit-distance graph G; whose vertex-set is the rational plane Q?, where, as usual, 
two vertices are adjacent if and only if they are at distance 1 apart. He then showed 
that the desired unit-distance graph G on the vertex-set E” is obtained by tiling the 
plane by translates of the graph G — that is, its edge-set is {(p1, p2): pi, p2 € E’, pi — 
p2 € Q*, and |p; — p2| = 1}, and proved that y7*°(G) = 2 and 3 < y7*S(G) < 7. 
Payne proved first that any measurable set S of positive Lebesgue measure contains 
the endpoints of a path of length 3 in G. Of course, this rules out a 2-coloring of S. 
Payne continued: “We can then proceed in a similar fashion to Shelah and Soifer’s 
proof [42].” In 2009 Payne [36] constructed a new class of unit distance graphs on 
the vertex-set E” whose chromatic number depends upon the system of axioms for 
set theory. 


13 Predicting the Future 


In 2003 Shelah and Soifer [42] observed the following surprising result. 


Theorem 23 Assume that any finite unit-distance plane graph has chromatic 
number not exceeding 4. Then y2¥© (E’) = 4, but 7285+ (E’) > 5. 


Can we obtain any results unconditionally? Yes, we can [49], but not yet in ZFC. 
Theorem 24 72"S+ (E7) > 5. 


We conclude with the author’s conjectures [49] of the expected value of the 
chromatic number of the plane, and more generally of E"—in ZFC. Formulating 
conjectures is a form of predicting the future, is it not? 


Conjecture 25 x (E”) = 4 or 7. 


If the chromatic number of the plane were 4, then Theorem 23 would imply that 
the chromatic number of the plane does not exist in absolute, but depends upon the 
choice of the system of axioms for set theory. However, if the author were limited 
to a single value, he would conjecture the value 7: 


The Chromatic Number of the Plane Conjecture 26 x (E”) = 7. 


If this conjecture is true, then a unit-distance 7-chromatic finite graph exists in 
the plane. In 1998 Dan Pritikin [38] published a lower bound for the order of such a 
graph: 


Theorem 27 Any unit-distance 7-chromatic graph G has at least 6198 vertices. 


In fact, the order of the smallest such graph may be even larger. 
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The following important result from the 1999 Ph.D. thesis of Paul O’ Donnell 
makes many of us think that the chromatic number of the place is greater than 4: 


The O’Donnell Theorem 28 For any n > 3, there is a unit distance 4-chromatic 
graph of girth n. 


For the 3-dimensional space the author [49] conjectures as follows: 
Conjecture 29 y (E*) = 15. 
In general, the author conjectures [49]: 


The Main Chromatic Number Conjecture 30 For any positive integer n > 1, 


Xx (E") = gntl_ 1. 


To paraphrase Paul Erdés’ words about some of his conjectures, we can say, the 


Main Conjecture will likely withstand centuries, but we shall see! 


The author is forever indebted to the late Paul Erdés and Edward Nelson for 


their inspiration and friendship. A heartfelt gratitude goes to John F. Nash, Jr. and 
Michael Rassias for their appreciation of this problem and a kind invitation to write 
this chapter in a group of distinguished colleagues for this spectacular collection of 
open problems of mathematics. 
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Erd6s’s Unit Distance Problem 


Endre Szemerédi 


Abstract We survey some problems and results around one of Paul Erd6s’s favorite 
questions, first published 70 years ago: What is the maximum number of times that 
the unit distance can occur among 7 points in the plane? This simple and beautiful 
question has generated a lot of important research in discrete geometry, in extremal 
combinatorics, in additive number theory, in Fourier analysis, in algebra, and in 
other fields, but we still do not seem to be close to a satisfactory answer. 


1 A Short History of the Problem 


Diameters Consider all (5) pairs taken from an n-element point set P in the plane. 
In 1934, Hopf and Pannwitz [33] discovered that the number of pairs that determine 
the largest distance, that is, the diameter of P is at most n. This bound is attained 
by the vertex set of the regular n-gon and by many other configurations. Shortly 
after, Erdés generalized the question in several different ways. In particular, he 
asked that at most how many times the diameter can occur among 7 points in 
3-dimensional space and in higher dimensions. His childhood friend, Andy 
Vazsonyi conjectured that in 3-space the answer was 2n — 3, for every n > 4. This 
was proved independently by Griinbaum [28], Heppes [32], and Straszewicz [60]. 
The bound is sharp for the set of vertices of a tetrahedron with n — 4 additional 
points that lie on a circle C passing through two vertices of the tetrahedron, such 
that the perpendicular axis of C is the line induced by the two other vertices. 
This result easily implies that Borsuk’s famous conjecture [6] is true in 3-space: 
every d-dimensional point set can be decomposed into d + 1 pieces of smaller 
diameter. Surprisingly, in 1993, Kahn and Kalai [36] disproved Borsuk’s conjecture 
when d is sufficiently large. At present, the smallest known counterexample is 
64-dimensional [34]. It is an interesting question to decide what happens in lower 
dimensions. 
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The diameter graph of a point set P is the graph on the vertex set P, in which two 
vertices are connected by an edge if and only if their distance is the diameter of P. 
A beautiful generalization of the above mentioned result of Hopf and Pannwitz was 
suggested by Schur [53]. He conjectured that the maximum number of d-cliques 
(complete subgraphs with d vertices) in the diameter graph of a d-dimensional point 
set is n. The answer does not depend on the dimension! Schur et al. [53] proved 
this conjecture for d = 3. Morié and Pach [48] verified it in the case when any two 
d-cliques of the diameter graph share d — 2 vertices, and conjectured this condition 
always holds. Finally, Kupavskii and Polyansky [42, 43] proved this last conjecture 
and, hence, settled Schur’s conjecture in the affirmative. 


Geometric Graphs Erdés made another fruitful observation concerning the Hopf- 
Pannwitz result on diameters. He noticed that if we draw the edges of the diameter 
graph as straight-line segments, any two edges share an endpoint or an interior point. 
We call a graph whose vertices are points in the plane, no 3 on a line, and whose 
edges are closed segments, a geometric graph. Erdés pointed out that essentially 
the original proof of the Hopf-Pannwitz result also gives that every geometric 
graph with n vertices and no 2 disjoint edges has at most n edges. Following 
this observation, Erdés, Avital and Hanani [3], Kupitz [44], and later Perles and 
Pach [49], started to systematically explore extremal problems for geometric graphs, 
where the forbidden configuration is “geometrically defined”. What is the maximum 
number of edges that a geometric graph of n vertices can have if it contains no 
k disjoint edges, no k pairwise intersecting edges, no non-selfintersecting path of 
length k, etc.? This has led to the birth of a rich separate area within combinatorial 
geometry, which is usually called geometric graph theory. See [50], for a recent 
survey. 


Unit Distances In 1946, Erdés [20] published a short paper in the American 
Mathematical Monthly, in which he suggested a very natural modification of the 
Hopf-Pannwitz question. Let P be a set of n points in the plane. What happens if we 
want to determine or estimate u(n), the largest number of unordered pairs {p, g} C P 
such that p and gq are at a fixed distance, which is not necessarily the largest distance 
between two elements of P? Without loss of generality we can assume that this 
distance is the unit distance. This explains why Erdés’s question is usually referred 
to as the unit distance problem. That is, 


u(n) = max |{{p,q} CP: |p—q| = 1}. 
PCR?2,|P|=n 


Using classical results of Fermat and Lagrange, Erdés showed that one can 
choose an integer x < n/10 that can be written as the sum of two squares in at least 
n‘/‘ozloe” different ways, for a suitable constant c > 0. Thus, among the points of 
the ./nx ./n integer lattice, there are at least (1/2)n!*¢/!°8!8” pairs whose distance 
is ./x. Scaling this point set by a factor of 1/./x, we obtain a set of n points with at 
least (1/2)n!*¢/'8!8”, i.¢., with a superlinear number of, unit distance pairs. 
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Fig. 1 If the segments Q 

PA, QA, PB, QB, QA’ are of 

length 1, then the length of 

PA’ cannot be | B A 


A 


For any set of n points P, define the unit distance graph of P as the geometric 
graph Gp on the vertex set V(Gp) = P, in which two vertices are joined by a 
segment if and only if their distance is 1. We want to estimate the maximum number 
of edges that such a graph can have. Erdés noticed that Gp does not contain K2 3, 
a complete bipartite subgraph with 2 and three vertices in its classes (see Fig. 1). 
He proved that the maximum number of edges of a K2,3-free graph on n vertices is 
con*/*, for some cr > 0; see [19]. This statement and Turdn’s theorem [68] were the 
first instances of a result that belongs to the—by now vast—field called (Turan-type) 
extremal graph theory; see Bollobas [5]. Summarizing, Erdés proved that 


yi ter/loglogn < u(n) ce on!?, 


for some c),c2 > 0, and he conjectured that the order of magnitude of u(n) is 
roughly n'*¢/ les!" Tn spite of many efforts to improve on the upper bound, 70 
years after the publication of the paper in the Monthly, the best known upper bound 
is still only slightly better than the above estimate. Erdés’s upper bound was first 
improved by Jézsa and Szemerédi [35] to o(n*/*), and 10 years later by Beck and 
Spencer [4] to O(n!9/°). In a joint paper with Spencer and Trotter [59], I proved 
u(n) = O(n*/3), which is the best currently known result. 

Erdés’s approach has had a tremendous impact on combinatorial geometry. 
Today, when we try to solve a geometric optimization problem, first we attempt to 
extract some special combinatorial property of the underlying structure. If we can 
associate a graph or a hypergraph H with the geometric situation, we check if we can 
exclude a so-called forbidden configuration (subgraph or subhypergraph) Hp. If this 
is the case, we forget about geometry, and try to establish or apply some known 
results to bound the number of (hyper)edges of an Ho-free graph or hypergraph. 
This point of view has fertilized extremal combinatorics and motivated many new 
problems and results. 


Incidences Given acurve y and a point p, we say that there is an incidence between 
them if y passes through p, that is, p € y. Up toa factor of at most 2, the unit distance 
problem can now be rephrased as follows. Given a set P of n points in the plane 
and a set I of n unit circles, what is the maximum number of incidences between 
them? Of course, the same question can be asked for any set of n curves taken 
from a fixed family of curves other than the family of unit circles. An important 
special case is when I consists of n distinct straight lines. For this case, Erdés 
conjectured and I proved together with Trotter [64, 65] that the maximum number of 
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incidences is O(n*/*). Note that in a finite projective plane the number of incidences 
between n points and n lines can be as large as cn*/?. Thus, our result establishes a 
combinatorial distinction between the Euclidean plane and finite projective planes. 
The proof technique of the original paper was extended by Téth [67], who generalize 
the Szemerédi-Trotter theorem to the complex plane. Later a slightly weaker, but 
more general result was proved by Solymosi and Tao [57], using the polynomial 
ham-sandwich theorem of Guth and Katz [29, 30] and methods from algebraic 
geometry. 

Our bounds on the number of incidences between points and unit circles, and 
points and lines in the plane have a common generalization. A family of curves has 
k degrees of freedom and multiplicity type s if 


1. for any k points there are at most s curves of I’ passing through all of them, and 
2. any pair of curves from [ intersect in at most s points. 


For example, the family of all straight lines in the plane has 2 degrees of freedom 
and multiplicity type 1. The family of all unit circles has 2 degrees of freedom 
and multiplicity type 2. The family of all circles of arbitrary radii has 3 degrees of 
freedom and multiplicity type 2. The family of all polynomial curves defined by an 
equation y = p(x), where p is a polynomial of degree d, has d+1 degrees of freedom 
and multiplicity type d. According to the Pach-Sharir theorem [51], the number of 
incidences between n points in the plane and m curves taken from a family T° of 
curves with k degrees of freedom and multiplicity type s is at most 


clk, s) (nk/ RD py(2k-2)/2K—D wae m) 


for a constant c(k, s) depending only on k and s. In the special case m = n,k = 2, 
we obtain the bound u(n) = O(n*/?) on the maximum number of unit distances 
among n points in the plane and the Szemerédi-Trotter theorem on incidences 
between points and lines. 


Computational vs. Combinatorial Geometry In the late 1980s, it became clear 
that results about incidences play an important role in analyzing many basic 
algorithms in computational geometry, computer vision, and robotics. Clarkson, 
Edelsbrunner, Guibas, Sharir, Welzl, and other computer scientists revisited these 
questions, and made many important contributions. In particular, they gave a new 
proof of the Szemerédi-Trotter theorem based on space partitions constructed by 
random sampling methods [16]. They also generalized the incidence results in a 
natural way, motivated by applications in motion planning. A set of m lines divide 
the plane into at most ‘ee + 1 cells. We pick n of them, and we would like to 
bound from above the total number of their sides. It was shown in [16] that this 
quantity is at most O(n7/3m?/? + n + m). By perturbing a little the arrangement of 
lines, so that a point incident to k lines becomes a cell with O(k) sides, we obtain 
that the number of incidences between m lines and n points is O(n2/?m?/3 +n +m). 
Similar generalizations have been established for the total number of sides of n 
cells in arrangements of various kinds of curves. Combinatorial and computational 
geometry have both benefitted a lot from decades of interaction. 
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Distinct Distances In the same 1946 paper published in the American Mathe- 
matical Monthly where Erd6s raised the unit distance problem, he also asked the 
following closely related question. What is the minimum number f(n) of distinct 
distances determined by a set P of n points in the plane? In other words, if we list 
with multiplicities all (5) distances between the points of P, at least how many of 
these numbers are necessarily distinct? Erdés conjectured that f(n) > cn/./logn 
for a suitable constant c > 0. Again, this would be asymptotically tight for an 
integer lattice of size ./n x ./n. Obviously, if we have an upper bound on u(n), the 
maximum number of times the same distance can occur among n points, this yields a 
lower bound on f(n), the minimum number of distinct distances. This follows from 
the trivial inequality 


() 
u(r)” 


Therefore, the best known bound u(n) = O(n‘/>) only implies that f(n) > cn?/?. 
This result was subsequently improved in [14, 15, 39, 58, 63], culminating in the 
result of Katz and Tardos [40], which states that for any ¢ > 0, any set of n > 
no(€) points in the plane has an element from which there are at least n*~* distinct 
distances, where w = (48—14e)/(55—16e) ~ 0.8641. Thus, we have f(n) > n°8%, 
provided that n is sufficiently large. These developments suggested that it might 
be easier to settle the problem of distinct distances than the unit distance problem. 
Indeed, in a sensational breakthrough, Guth and Katz [30] almost completely settled 
Erdés’s question. They proved that f(n) > cn/ log n. Curiously, no similar bound is 
known for the stronger version, where we want to find a point from which there are 
many distinct distances. 


f(n) = 


2 A Short Proof of the Best Upper Bound on u(r) 


In the original paper [59], in which we proved the best known upper bound, 
u(n) = O(n*/3), for the unit distance problem, we modified the ideas in [65]. The 
proof had two main components: a regular subdivision of the plane and a counting 
argument. Combining them, we found that if the number of unit distances were too 
large, the total number of intersection points between the unit circles drawn around 
the n points would exceed 20). which is impossible. The aim of this section is to 
present a much more elegant argument, due to Székely [63], which is based on the 
Crossing Lemma of Ajtai et al. [1] and, independently, Leighton [45]. In retrospect, 
the idea of this proof does not differ much from the original one, but there is no need 
to construct a regular subdivision of the plane: it will be automatically created by a 
random “thinning” of the unit distance graph; see below. As was mentioned earlier, 
there is a third proof [16], in which the regular subdivision itself is constructed 
by random sampling. The same proof is “redressed” by Kaplan et al. [38]. Here the 
only difference is that the partitioning is created using the polynomial ham-sandwich 
theorem of Guth and Katz. 


464 E. Szemerédi 


We start with presenting the Crossing Lemma. We need to extend the notion 
of geometric graphs. A topological graph is a graph drawn in the plane so that 
its vertices are represented by points and its edges are represented by Jordan arcs 
connecting the corresponding point pairs, where 


1. no arc passes through any point representing a vertex, other than its endpoints; 
2. any two arcs intersect in a finite number of points; 
3. no two arcs are tangent to each other. 


If two arcs have an interior point in common, they must properly cross at this point. 
We refer to such a point, as a crossing, and we say that the two edges (arcs) cross 
each other. For simplicity, in terminology, we make no distinction between vertices 
and the corresponding points and between edges and the corresponding arcs. 


Crossing Lemma ([1, 45]). Let G be a topological graph with n vertices and 
m > 4n edges. Then cr(G), the number of crossing pairs of edges, satisfies the 
inequality 


m 


1 
er 
onG) 2 5 =e 


Proof. The following proof is folkloristic: it was discovered by Lovasz, MatouSek, 
Pach, and others. It follows by induction on m that 


cr(G) > m—3n+6>m-— 3n. (1) 

For any 0 < p < 1, define a random (topological) subgraph G, of G, by selecting 
each vertex of G independently with probability p, and letting G, denote the 
subgraph induced by the selected vertices. That is, an edge (arc) of G belongs to G, 
if and only if both of its endpoints have been selected. Let n(G,) and m(G,) denote 


the number of vertices and the number of edges in G,, respectively. According to (1), 
we have 


cr(G,) = m(G,) — 3n(G,). 
Taking the expected values of both sides, we obtain 
E[cr(G,)] = E[m(G,)] — 3E[n(G,)]. 
Clearly, we have E[n(G,)] = pn and E[m(G,)] = p’m. On the other hand, the 
expected number of crossing pairs of edges not incident to the same vertex of G, is 


equal to p*cr(G). Consequently, 


p'cr(G) = p’m — 3pn. 
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Setting p = 4n/m, we obtain that 


3n nm 3m? 1 mm 


yp lon? 64n2 64. 2” 


cr(G) = . _ 
p 


which completes the proof. Oo 


As before, let u(n) denote the maximum number of times that the unit distance 
can occur among n points in the plane. The desired inequality u(n) < cn*/?, for a 
suitable positive constant c, is an immediate corollary of the m = n special case of 
the following theorem. 


Theorem ([16]). There is an absolute constant c > 0 such that the number of 
incidences between n points and m unit circles in the plane is at most c(n??m?/? + 
n+m). 


Proof. The following argument is due to Székely [63]. Let P be a set of n points and 
T a set of m unit circles in the plane. Let /(P, ) denote the number of incidences 
between the elements of P and T. 

Define a topological graph G on the vertex set P, as follows. Let any pair of 
points p1,p2 € P that are consecutive along a circle y € I’ be connected by the arc 
of y that passes through no other points in P. We delete those arcs that run along 
circles incident to at most two points. 

We have |V(G)| = nand |E(G)| > I(P, C)—2m. Notice that G is a “multigraph,” 
in the sense that the same edge can occur twice, along two arcs belonging to two 
different circles y;, y2 € I’. Keep only one of these arcs. This will reduce the number 
of edges by a factor of at most 2. With a slight abuse of notation, the resulting 
topological graph is still denoted by G. Applying the Crossing Lemma, we obtain 
for the number of crossing pairs of edges 


1 (U(P, C) — 2m)/2)° 


or(G) 2 64 n2 


provided that |E(G)| => 4n. On the other hand, we clearly have 
cr(G) < m(m — 1), 
as the total number of intersection points between m circles, with multiplicities, 


cannot exceed 2(5) = m(m — 1). Comparing the last two inequalities, the theorem 
follows. Oo 


Incidences Between Points and Lines Székely’s argument, with some simplifi- 
cations, can also be used to establish the Szemerédi-Trotter theorem mentioned 
above: There is an absolute constant c > 0 such that the number of incidences 
between n points and m lines in the plane is at most c(n?/3m?/? + n + m). The order 
of magnitude of this bound is best possible. In [65], this result was stated in the 
following (seemingly weaker, but equivalent) form. 


466 E. Szemerédi 


Theorem ((65]). Let t and n be positive integers, t < ./n. Let P be a set of n points, 
I’ aset of lines in the plane. Then the number of distinct lines in I’ that pass through 
at least t points of P is at most cn*/t°. 


Sum-Product Estimates Given a set of t non-zero reals A, let A+A and A-A denote 
the set of all pairwise sums {a + b: a, b € A} and pairwise products {ab : a,b € A} 
formed by the elements of A. If the elements of A form an arithmetic progression or 
a geometric progression, then |A + A| < 2t— 1 and |A-A| < 2t— 1, respectively. 
In a joint paper with Erdés [25], it was shown that it cannot occur that both of these 
sets have O(t) elements. More precisely, we proved the existence of a small ¢ > 0 
such that max{|A + A], |A-A|} > ¢'*°, 

Gyorgy Elekes, whose ingenious ideas contributed a lot to the near-solution of 
Erd6és’s problem on distinct distances by Guth and Katz, surprised us by a very 
elegant improvement on our theorem, based on the above upper bound on the 
number of lines that pass through at least t elements of an n-element point set in 
the plane. 


Theorem ([(18]). There is a constant c > 0 such that for every set of t reals, we 
have 


max{|A + A, |A- A]} > ct?/4. 


Proof. Let a;,...,a,; denote the elements of A. For every j and k with | <j,k < t, 
define 


Fix) = aj(x — ag). 


Notice that for a fixed k, the function f;, maps the elements a, + a; into ajaj, 
i= 1,...,¢. Thus, setting P := (A + A) x A-A, we find that the graph of each 
function fix is a line that passes through at least f points of P. We have f° such lines. 
Applying the previous theorem, we obtain that 


2< 


Pl 
ae a 


which implies that |P| = |A + A|-|A-A| > P/2/,/c, as required. Oo 


Elekes’s bound has been slightly improved by Solymosi [56], who established 
the inequality 


14/11 


t 


It is conjectured that the lower bound can be replaced by t?~*, for every ¢ > 0, 
but currently such a result does not seem to be within reach. Today sum-product 
problems have a huge literature. In particular, there are many deep sum-product 
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estimates for finite and other infinite fields, found by Bourgain et al. [7], Bourgain 
et al. [8], Konyagin and Rudnev [41], and others. The subject has become an 
important separate theme within additive combinatorics [66]. 


3 A Forbidden Submatrix Argument 


Erdés’s trick of reducing a geometric problem to a question in extremal graph or 
hypergraph theory has become a standard approach in combinatorial geometry. 
Unfortunately, it does not always work. Sometimes it might work, but the com- 
binatorial essence of the structure is hidden and cannot be easily extracted. Most of 
the above mentioned proofs of the bound u(n) = O(n*/?) for the maximum number 
of times the unit distance can occur among n points in the plane, use a forbidden 
subgraph argument, but it has to be applied to several carefully selected subgraphs of 
the unit distance graph. Applying them directly to the full graph, we obtain Erdés’s 
initial bound u(n) = O(n*/?). 

The only proof we know that follows by a direct application of a forbidden 
substructure theorem is due to Pach and Tardos [52]. To state their combinatorial 
result, we need some definitions. 

Given a graph G on n vertices v1,..., Un, let A(G) denote its n x n adjacency 
matrix, in which we put a | at the position (i,j) if and only if v;v, is an edge of G. 
Otherwise, the entry is 0. Conversely, for any symmetric 0-1 matrix A with an all- 
zero diagonal, let G(A) denote the graph whose adjacency matrix is A. A sequence 
C = (Po.P1.---+P2x) Of positions in a zero-one matrix A forms an orthogonal 
cycle if pp = p2, and the positions p2; and p2;+; belong to the same row, while 
the positions p2;+; and p2;+2 belong to the same column, for every 0 < i < k. If the 
entry of A in position p; is | for all 0 < i < 2k, then we call C an orthogonal cycle of 
A. For any symmetric 0-1 matrix A, every cycle of G(A) with a fixed starting point 
and orientation corresponds to an orthogonal cycle of A. 

Given a position p = (i,j) of the matrix A and an orthogonal cycle 
C = (po0,P1,--+-+P2x), let C(i,j) be the number of times that the possibly self- 
intersecting polygon pop; ... 2, encircles the point p’ = (i+ 1/2,j + 1/2) of the 
plane in the counter-clockwise direction. Here the position (i,j) in the matrix is 
represented by the point (i,j) of the plane. (This convention contradicts the tradition 
of writing the first row of a matrix on top!) More precisely, let P(i,j) be the set of 
positions (i, j’) with i’ > i andj’ > j, and set 


Cj) = {0 <1 Sk: pa € PUA) — HO <1 Sk: pai € PGS}. 


An orthogonal cycle is said to be positive if C(i,j) => 0 for every pair (i,j) and C(i,j) 
is strictly positive for at least one such pair. 

Pach and Tardos proved the following result, from which the bound 
u(n) = O(n*/3) for the number of unit distances can be deduced relatively easily. 
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Theorem ((52]). The maximum number of | entries in ann x n 0-1 matrix that 
contains no positive orthogonal cycle is O(n*/*). The order of magnitude of this 
bound cannot be improved. 


Since the last theorem is tight, this approach cannot lead to any improvement of 
the u(n) = O(n*/?) bound on the number of unit distance pairs in a set of n points 
in the plane. 


Is the O(n*/) Bound on the Number of Unit Distances Tight? It is rather 
remarkable that none of the known proofs for u(n) = O(n*/?) offers any hope 
to break this barrier, although it is conjectured that u(n) = O(n'*®), for every 
€ > 0. How is this possible? All existing proofs easily generalize to the case where 
the plane is equipped with some other strictly convex metric, i.e., with a metric 
according to which the unit disk is a centrally symmetric strictly convex region. 
However, using an idea of Brass [9], Valtr [69] found the following simple metric, 
with respect to which the number of unit distance pairs among n points in the plane 
can be as large as cn*/3, for some c > 0. 

Let the unit circle of this metric be the locus of all points (x, y) satisfying the 
equation |y| = 1—x*. This is a closed curve y, centrally symmetric about the origin, 
and it consists of two parabolic arcs. Assume for simplicity that n is of the form 
(2k + 1)(2k? + 1) for some positive integer k, so that k ~ (n/4)!/3. Consider the 
n-element point set 


P, = (7-4) li <kul<e) ct, 41P. 
Notice that the unit circle y passes through 4k points of P,. No matter how we 
translate y by a vector belonging to P,,, it will pass through at least k points of P,,. 
In other words, with respect to this metric, every point of P, is at unit distance 
from at least k others. Therefore, the number of unit distance pairs is at least 
lnk = nt, 
2 321/3 

Algebraically, this metric is quite similar to the Euclidean one! Which special 
properties of the Euclidean metric do we have to explore in order to break the 
bound u(n) = O(n*/?)? Perhaps there is no such property, and the bound O(n*/*) 
is optimal. Frankly, we do not know too many interesting examples of planar 
point sets, within which there is an exceptionally popular distance. There is no 
strong evidence supporting the assumption that such an example cannot exist. 
For convenience, we conjecture that the best constructions are latticelike. 


In Almost All Metrics, There Are Few Unit Distances Consider the plane 
equipped with an arbitrary metric with a strictly convex unit circle y centered at 
the origin. Let Po be the 1-element set whose only point is the origin. Pick a random 
element q; € y, and set P; := Py U (Po + q)). If the sets Po,..., Pr; have already 
been defined, then choose randomly a point g, € y, and let Py := Px—, U(Pr_-1 +. 4x). 
After k steps, we obtain a set P, of n := 2" points in the plane such that from 
every point there are precisely k = log,n others at unit distance. Thus, the 
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number of unit distance pairs in P;,, with respect to our metric, is at least in logn. 
In fact, almost surely, exactly this many unit distances will occur in P;,, because 
there will be no other “accidental” incidences between the points in P, and the 
unit circles centered at its elements. Moreover, MatouSek [47] proved that in the 
Baire category sense, with respect to almost all metrics, every set of n points in 
the plane determines only O(n lognloglogn) unit distances. We may take it as 
an indication that Erdés’s conjecture that in the Euclidean plane the number unit 
distances between n points cannot exceed O(n!+¢/!2!°8"), However, this evidence 
should be taken with a grain of salt. First, nlognloglogn is much smaller than 
n'te/leslos” Second, Baire category provides a rather counter-intuitive measure on 
the class of centrally symmetric convex curves y. For many curious properties of 
curves, it is hard to exhibit any explicit example that has that property, yet it can be 
shown that almost all curves in the Baire category sense possess it. The situation is 
analogous to the case of random graphs, where for instance we can show that almost 
all graphs of n vertices have clique number and independence number O(log n), yet 
it looks hopelessly difficult to come up with any explicit construction. 


Unit Distances on the Sphere In classical geometry, most problems are considered 
not only in the Euclidean space, but also in spherical and hyperbolic geometry. For 
our problem, the case of the sphere is particularly interesting. All known planar 
proofs easily extend to the sphere, so we have that the number of times that the 
same (angular or Euclidean) distance can occur among n points on the sphere S? of 
radius | is O(n*/3). In the plane, a ./nx ./n piece of the integer lattice is conjectured 
to provide the maximum number of unit distance pairs, but on the sphere there is no 
grid with two independent translational (rotational) symmetries. 

Leo Moser conjectured that on the unit sphere the same distance 6 among n 
points can occur only O(n) times. However, he overlooked a detail: the answer may 
depend on 8. As Erdés et al. [22] discovered, for 6 = 1/2, there may be cn*/? point 
pairs at angular distance 64, that is, the above bound is tight. To see this, suppose that 
n is even, and take n/2 points, p1,...,Pn/2, and n/2 lines 1;,...,J,/2 in the plane 
z = —1 in R? with at least cn*/3 incidences between them, for some c > 0. For each 
lj, let vj denote normal vector of the plane spanned by /; and the origin 0 = (0, 0, 0). 
Fori = 1,...,n/2, let g; and r; denote the intersection points of the line 0p; and the 
supporting line of v; with the unit sphere, respectively. Notice that whenever a line 

—> 


J; passes through a point p;, the vectors v; and 0p; are perpendicular and, hence, the 
angular distance between q; and r; € S? is equal to 2/2. Thus, the number of point 
pairs in 


{Oto 065 Yn/2s Vise ++ snr} Cc S? 


at angular distance 1/2 from each other is at least cn*/?, the number of incidences 
between the lines J; and points vj. 

On the other hand, Erdés et al. [22] disproved Moser’s conjecture for any angle 6 
different from 2/2. They constructed n-element point sets in S? with a superlinear 
number of 5-distance pairs, as n —> oo. Their nlog*n lower bound, where 
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log* stands for the iterated logarithm function, was improved by Swanepoel and 
Valtr [62] to cn./log n, for a suitable positive constant c. The gap between the lower 
and upper bounds is still huge! Strangely, on the sphere, even the trivial 5n log n 
lower bound construction described at the beginning of the previous subsection 
breaks down. 


4 Unit Distances with Special Angles 


Apart from his conjecture that the unit distance among n points cannot occur more 
than n!+¢/leslee” times, Erdés also made the stronger conjecture that, with the 
possible exception of a few points, all extremal examples can be obtained as subsets 
of a suitable 2-dimensional lattice. It is needless to say that we are very far from 
being able to verify or disprove this conjecture. Nevertheless, serious computational 
and other attempts were made to decide whether such a statement may be true at 
least for small values of n or in some restricted situations. 

Brass [11] managed to prove Erdés’s stronger conjecture in the special case when 
we count only unit distance pairs parallel to one of k fixed directions in the plane. 
To state his result more precisely, we need some notation. For any point set P in the 
plane and for any set U of k unit vectors, no two of which are opposite each other, let 


f(P,U) = l{p.qgePxP:p—qeU}}. 


Let f(n, k) be defined as the maximum of f(P, U) over all such sets P of n points and 
all sets U of k unit vectors with the above property. Obviously, we have f(n, 1) = 
n—1. The unit distance graph of a point set restricted to two directions is the disjoint 
union of subgraphs of unit distance graphs of square lattices. Therefore, it follows 
from a theorem of Harary and Harborth [31] that 


f(n,2) = [2n— 2a]. 


For any two positive functions, g and h, we write that g(n) = O(h(n)) if cih(n) < 
g(n) < c2h(n) for suitable constants cy, cz > 0. 


Theorem ([{11]). For any fixed k > 3, we have f(n,k) = kn— © (Jn). 

Furthermore, there are a finite number of lattices A\(k), Ao(k),... and an 
integer no(k) such that for every n > no(k) the extremal configurations Px, Ux are 
subsets of one of the lattices A;(k). Moreover, at least one of the extremal n-element 
point sets P,, with the exception of ./n points, can be obtained by intersecting Aj(k) 
with a convex Set. 


zoo 


If the number mo(k) were small, this would prove Erdés’s stronger conjecture, 
because every extremal set for the unit distance problem with no restriction on 
the number of directions is also an extremal set for some number of directions k. 
Unfortunately, this is not the case. 
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The direction (or angle) of a straight line in the plane is said to be rational if 
the angle between the line and the x-axis is a rational multiple of z. The special 
case of Erd6és’s unit distance problem where we count unit distances in all rational 
directions, was studied by Schwartz et al. [55]. In this case, they showed that Erdés’s 
weak conjecture is not far from being optimal: For any ¢ > 0, there exists no(é) 
such that the number of unit distance pairs with rational angles in a set of n = no(e) 
points in the plane is at most n\*°. 

Unit vectors in rational directions correspond to roots of unity. The proof 
proceeds by counting certain paths in the unit distance graph and using a theorem 
of Mann [46] to bound the number of edges. 

Using the Subspace Theorem in place of Mann’s Theorem, Ryan Schwartz 
considered unit distances from a multiplicative group with rank not too large with 
respect to the number of points [54]. As before, a unit distance in the plane will 
be considered as a complex number of unit length. So all unit distances can be 
considered as coming from a subgroup of C*. Schwartz established the following 
generalization of the Schwartz-Solymosi-de Zeeuw theorem stated above. 


Theorem ((54]). For any € > 0, there exist a positive integer no = no(e) and a 
constant c = c(e€) > 0 such that given n > no points in the plane, the number 
of unit distances coming from a subgroup I. C C* with rank r < clogn is at 
most n'*®, 


Proof. Suppose G = G(V, E) is a graph on v(G) = n vertices and e(G) = m edges. 
We denote the minimum degree in G by 6(G). 

Note that, by removing vertices with degree less than m/(2n), we obtain a 
subgraph H with at least e(H) > m/2 edges and 6(H) > m/(2n). The number 
of vertices in H is at least v(H) > ./m. We will consider such a well behaved 
subgraph instead of the original graph. 

Let G be the unit distance graph on n points with unit distances coming from 
T as edges. We show that there are fewer than n't such edges, i.e., distances, for 
any ¢ > 0. We can assume that e(G) > (1/2)n'**, v(G) > n'/*+*/? and 6(G) > 
(1/2)n*. 

Consider a path in G on k edges Py = popi... px. We denote by u;(Px) the unit 
vector between p; and p;+1. The path is nondegenerate if )°,-,ui(Px) = 0 has no 
solutions where / is a nonempty subset of {0,1,..., — 1}. Note that such a sum is 
a sum of elements of I’ with no vanishing subsums. We will denote by ?;(v, w) the 
set of nondegenerate paths of length k between vertices v and w. 

The number of nondegenerate paths of length k from any vertex is at least 


k-1 ke 


e nN 
Lee +) 2 or 


The first expression is true since if we consider a path Py on £ < k edges then all 
but 2° — | possible continuations give a path P¢+, with no vanishing subsums. The 
inequality is true if we assume 2* < (1/2)n°, which is true if k < (elogn)/log2—1. 
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From this, we get that the number of nondegenerate paths P; in the graph is at least 
nl/2+(k+1/2)¢ /92k+1_ So there exist vertices v, w in G with 

nktl/2e-3/2 

[Pe(v,w)| = — 


Consider a path Py € Px(v,w), Pk = popi--.px. Let a be the complex number 
giving the vector between po and px. Since P; is nondegenerate we get a solution 
of (1/a)x; + (1/a)x2 + --- + (1/a)x, = 1 with no vanishing subsums. Thus, by a 
corollary of the Subspace Theorem, due to Amoroso and Vieta [2], we obtain 


[Pe(v, w)| < (Ber TD, 
This, with the lower bound, gives 
((k + 1/2)e — 3/2) logn < klog 4 + 4k*(k + kr + 1) log(8k) 


< Crk logk, 


/ 1-4 
- crk* logk 


cl 
+—. 2 
logn k @) 


Since r + 1 < clogn, we can choose k an integer satisfying 
C'((logn)/r)'° < k < C"((logn)/r)'”. 
Then, with this k, the right hand side of (2) goes to zero as n increases. Earlier 
we assumed that k < (elogn)/ log 2 — 1. This holds for the value of k given above 


for n large enough. So the number of unit distances from T’ is less than cn'** for 
each ¢ > 0. O 


noo 


Performing a careful analysis of Erd6s’ lower bound construction, it is possible to 
verify that all unit distances come from a group with rank at most clogn/ log logn 
for some c > 0. This group is generated by considering solutions of the equation 
x+y = p where p is a prime of the form 4m-+ 1. Using the prime number theorem 
for arithmetic progressions, we get the bound on such solutions and thus on the rank. 
For the details, see [54]. So Erdés’ construction satisfies the conditions of the above 
theorem. A similar approach could be used for other types of lattices. So all the 
best known lower bounds for the unit distance problem have unit distances coming 
from a well structured group. It would be interesting to see if every configuration of 
points with the maximum possible number of unit distances has such a structure. 
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5 Variations of the Unit Distance Problem 


In this short survey, we cannot cover all aspects of the unit distance problems. 
We only mention those questions that we find most interesting. Our list of references 
is also somewhat uneven. It reflects our knowledge, our ignorance, and our taste. 
We refer the interested reader to the monograph Research Problems in Discrete 
Geometry by Brass et al. [12]. 

However, there are two variants of the problem that we cannot completely ignore 
even in such a short an subjective review. 


Point Sets in Convex Position We say that n points in the plane are in convex 
position if they form the vertex set of a convex polygon. Erdés and Moser [23] 
conjectured that the number of unit distances, Ucony(1), among n points in convex 
position in the plane satisfies ucony(1) = 3n+ O(1). They were wrong: Edelsbrunner 
and Hajnal [17] exhibited an example with 2n—7 unit distance pairs, for every n > 7. 
It is widely believed that ucony(1) = O(n), and perhaps even Ucony(n) = 2n + O(1). 
The best known upper bound is due to Ftiredi [27], who proved by a forbidden 
submatrix argument that ucony(1) = O(n logn). A very short and elegant inductional 
argument for the same bound can be found in [13]. 

Erdés suggested a beautiful approach to prove that ucony(m) grows at most linearly 
with n. He conjectured that every convex n-gon in the plane has a vertex from which 
there are no k+ 1 other vertices at the same distance. Originally, he believed that this 
is also true with k = 2, but Danzer constructed a series of counterexamples. Later, 
Fishburn and Reeds [26] even found convex polygons whose unit distance graphs 
are 3-regular, that is, in which for each vertex there are precisely three others at unit 
distance. If Erdés’s latter conjecture is true for some integer k, then this immediately 
implies by induction that ucony(n) < kn. 


Unit Distances in Higher Dimension Erd6s’s unit distance problem can be asked 
in any metric space. Let ug(n) denote the maximum number of unit distance pairs 
that can occur among n points in R¢. In their landmark paper [16] in which they 
found alternative proofs and generalizations of the Szemerédi-Trotter theorem on 
incidences between points and lines, Clarkson, Edelsbrunner, Guibas, Sharir, and 
Welzl proved that u3(n) < n°/?a(n), where a(n) is an extremely slowly growing 
function, closely related to the inverse of Ackermann’s function. During the last 
quarter of a century, no improvement was made on this bound, apart from the fact 
that now we know that the term a(n) can be eliminated; see Zahl [71] and [37], 
for another proof. By a simple number theoretic argument, Erdés [21] demonstrated 
that in an n!/3 x n!/3 x n!/3 cube lattice, the same (unit) distance occurs at least 
cn*/3 log logn times, for a constant c > 0. This is the best presently known lower 
bound, and its order of magnitude is conjectured to be nearly optimal. 

The innocent reader may suspect that if it is so difficult to determine the 
asymptotic behavior of the functions ug(n) already for d = 2 and 3, then the task is 
even harder for d > 3. However, this is not the case, as is shown by the following 
so-called Lenz configurations. For d > 3, take |d/2| mutually orthogonal circles of 
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radius 1/,/2 in R4, centered at the origin. Place n points on these circles, distributed 
among them as evenly as possible. Notice that the distance between any two points 
lying on different circles is precisely 1. Thus, the number of unit distances in this 
example is at least 5 (1 - wa + o(1)) nr. 

On the other hand, Erdés [21] proved that the unit distance graph of n points in 
each of its classes. Using the Erdés-Stone theorem [24], a ‘cornerstone in extremal 
graph theory, this condition is sufficient to deduce that for d > 3 we have 


ug(n) = : 


1 
(a 
This is one of the most beautiful early examples of the interplay between discrete 
geometry and extremal graph theory, from which both fields have richly benefitted. 

Swanepoel [61] actually proved that, for every d > 3 there exists no(d) such that 
all n-element extremal point sets for the d-dimensional unit distance problem are 
Lenz configurations, provided that n > no(d). Moreover, for even dimensions d > 6, 
he also succeeded in determining the exact value of ug(n) for all sufficiently large n. 
For d = 4, completing the work of Brass [10], van Wamelen [70] determined the 
precise value of u4(n) for sufficiently large n. 


+ o(t)) n. 
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Goldbach’s Conjectures: A Historical 
Perspective 


Robert C. Vaughan 


Abstract In 1742, Goldbach and Euler in conversation and in an exchange of letters 
discussed the representation of numbers as sums of at most three primes. Although 
the question as to whether every even number is the sum of one or two primes (the 
binary Goldbach conjecture) is still unresolved, this and associated questions have 
attracted many mathematicians over the years, and have lead to a range of powerful 
techniques with many applications. This article is a commentary on the historical 
developments, the underlying key ideas and their widespread influence on a variety 
of central questions. 


1 Introduction 


Christian Goldbach was a German mathematician who was a professor of mathe- 
matics and history in St Petersburg. On 7 June 1742, from Moscow, he wrote a letter 
to Leonhard Euler (letter XL) in which he proposed the following conjecture: 


Every integer which can be written as the sum of two primes, can also be written as the sum 
of as many primes as one wishes, until all terms are units. 


In the margin he than added a second conjecture. 


Every integer greater than 2 can be written as the sum of three primes. 


Of course he took | to be prime. One can also observe that since any representa- 
tion of an even number would have to include a 2 it would follow that 


Every even number is the sum of two primes. 


In a letter dated 30 June 1742 Euler reminded Goldbach of an earlier conversation 
they had in which Goldbach had pointed out that his original conjecture would 
follow from this last statement. 

Of course today we would state the Goldbach binary and ternary conjectures as 
follows. 
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Every even integer greater than 2 can be written as the sum of two primes. 
Every odd integer greater than 5 can be written as the sum of three primes. 
[Include facsimile of letter. ] 


The above is well known, of course. What is perhaps somewhat less well known 
is the following assertion of Descartes. 


Quod tamen nondum demonftraui. Sed & omnis numerus par fit ex vno vel duobus vel tribis 
primis. 
Descartes[ 1596-1650], Opuscula Posthuma 
It is not yet proved, but all numbers are made from one or two or three primes. 
Descartes[ 1596-1650], posthumous small works 


2 Sylvester 


There is nothing of consequence in the literature until 1871. Spottiswoode, then 
President of the London Mathematical Society, in his account [82] of communica- 
tions received during the meeting of 9th November 1871 describes at some length 
researches that Sylvester had been undertaking on the behaviour of 


R(n) = card{p,, p2 : py + p2 = n} (1) 


when n is even. In modern notation Sylvester asserts that probabilistic arguments 
suggest that 


p-—2 
S(n) = x(n) [] — (2) 
p-1 
psJn 
ptn 
should be a good approximation to R(n), and further asserts that this is confirmed 
by the known calculations. The product here is quite interesting. A simple argument 
based on the observation that a number x is divisible by p with probability ; leads 
instead to the expression 


p-2 p-l 
; ps aie 
I] I] (3) 
psJ/n P pin P 


ptn 


It turns out that (2) is bad and (3) is worse. However, as we shall see, it is interesting 
and curious that Sylvester should find the product 
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Of course, the prime number theorem was 20 odd years in the future, and it would 
be another 3 years before Mertens would prove that 


2 ~ log log x 


psx 


and 


Tl p-1l e? 
eae logx 


where y is Euler’s constant. The first of these expressions illustrates the underlying 
difficulty with probabilistic methods in that the series 


1 
2a, 


diverges. Armed with the above and the prime number theorem it is not hard to show 
that for even n 


s(n) ~ 26-72) PP} 


where C is the, so called, twin prime constant 
cI tap) 
p>2 
and to deduce as Landau [55] did in reference to similar work of Stackel [83] that 
2 
Y_R(n) ~ ee 
n<x 
whereas 


4g) 
Y> Sin) ~ 2e esa: 


nsx 


We have 


2e-” = 1.1229..., 
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and we now believe that for even n 


' p=1 
R(n) ~ Clio(n) | | so (4) 
pin 
p>2 
where 
; n—2 dx 
lig(n) = / — (5) 
»  (logx) log(n — x) 
Approximately 
linn) =—" + (6) 
1 = eae 
wv Cogn)?” Cogn) 
whereas the factor 2(n)/ log n satisfies 
m(n n n 
(n) _ (7) 


logn — (logn)? a (log n)3 aie 


These discrepancies will somewhat cancel each other for smaller n and perhaps 
account for Sylvester’s belief that the known calculations supported his formula. 


3 Hardy and Ramanujan 


The first two substantial lines of progress on Goldbach’s conjectures arose almost 
simultaneously. One was a consequence of a paper which nowhere mentions 
Goldbach, namely the seminal paper of Hardy and Ramanujan [38] which, as is 
widely known, is mostly concerned with a formula for the partition function. The 
fundamental idea is that there is some arithmetical function R(n) of interest and the 
generating function 


f@ = ORM?" 
n=0 


converges in the unit disc but has singularities on the circle centred at 0 of radius 1, 
and perhaps even has that circle as a natural boundary. Then by the Cauchy integral 
formula 


—_ 1 —n—-1 
R(n) = aq [ f@ dz 
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where @ is a circle of radius r, 0 < r < 1 about 0 and described in the positive 
sense. It is also supposed that r is close to 1. How close will depend on n and the 
behaviour of f(z) on @ as r approaches 1. Typically for functions of arithmetical 
interest the behaviour of f(z) can be ascertained quite precisely if 


z=re(a/q+B) ea) =e" 


with r “close” to 1 but 6 “small” and gq is “not too large”. Let me illustrate this 
with an example which relates to Waring’s problem. You will recall that Waring had 
asserted that every positive integer is the sum of four squares, nine cubes, nineteen 
biquadrates, “and so on’. Suppose that R(n) is the number of ways of writing n as 
the sum of s non-negative cubes. Then it is readily seen that 


f@) = YLRMz" = ge) 


n=0 


where 


a) = Doz". 
=0 


Let z be as above and let us sort the terms according to their residue class modulo gq. 
Then 


q 


g(2) = Do e(am*/q) S° rel). 


m=1 n=m mod q 


When r is close to | and f is small we can approximate the sum over n by an integral 
and after several changes of variables we get 


ez) ~q 'S(q,a( + 1/K)C — re(B))*. 


Here S(q, a) is the generalized Gauss sum 


q 


S(q,a) = Y> e(am/q) 


m=1 


and was shown by Hardy and Littlewood [36] to satisfy 
S(q.a) = O(q'"") 


provided that (g,a) = 1. When k = 2 this estimate can be made sufficiently precise 
that it is of utility on the whole of @, and this is described briefly by Hardy and 
Ramanujan. For s > 5 (and when s = 4 via the Kloosterman refinement) it leads 
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to an approximation to R(n), the number of representations of n as the sum of s 
squares, of the form 


(3/2) 


ii 
rea” 


R(n) ~ 


where 


oo q 
6(n) = )oq° SY) Sg,a)'e(-an/q). 

q=1 a=1 
(a.q)=1 
Modern technology (see Vaughan [86]) would enable this to be carried through in a 
routine way fork = 3 ands > 13 also. When k > 3, and at the time when k = 3, the 
approximation for g(z) obtained was only good for a rather small subset of @ and 
another substantial idea was required. This difficulty Hardy and Littlewood were 
able to overcome using ideas from Weyl’s seminal paper on uniform distribution. 
However pursuing this would take me too far from our path. 


4 Hardy and Littlewood 


When Littlewood returned from active duty in the First World war, he and Hardy 
set themselves the task of developing the fundamental ideas of the Hardy and 
Ramanujan paper. This eventually saw the light of day as the series of eight papers 
with the generic title “On some problems of partitio numerorum”, the first two of 
which are on Waring’s problem, and the seventh of which never appeared in print. 

I understand that the first questions they examined were in fact the Goldbach 
conjectures. Eventually they realised that they could only make progress by 
assuming the generalised Riemann hypothesism and they preferred that the first 
papers using what has come known as the Hardy—Littlewood method established 
unconditional theorems. However HI [33] and V [35] are concerned with the 
Goldbach conjectures. 

To describe these results I am going to make an innovation that only came later, 
due to Vinogradov. If we were to follow Hardy, Littlewood and Ramanujan the 
generating function for sums of s primes would be 


f@ =s8@)', 8) = > Mogp)2. 
p 


However if we are only interested in the representation of nm we can suppose that the 
primes p satisfy p < n. But now since the sum is finite we can work on the unit 
circle. Thus we will work with 


F(a) = ) “\(logp)e(ap) 


psn 
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and now we find that if 


r(n)= > (logpi)... ogps) (8) 


then the substitution z = e(@) in Cauchy’s formula gives 


1 
r;(n) =| F(a)*e(—an)da. 


We can also view this as a consequence of the fundamental orthogonality relation- 
ship of harmonic analysis. 

If we apply the recipe used previously of sorting the terms of F according to 
the residue classes in which the prime lie modulo qg, and we apply results that 
follow from the generalised Riemann hypothesis, then we find that, provided that 
also (q,a) = 1, 


F(a/q + B) = ae S| e(Bm) + o((a+ gn|B|)?n?**) 


m=2 


Here jz is the Mobius function and ¢ is Euler’s function. There is a famous theorem 
of Dirichlet on diophantine approximation that tells us, given areal parameter Q > 1 
and any real number a, there are g and a with (q,a) = 1 such that |w — a/q| < 
gq 'Q~'. Now if we take B = a — a/q we see that the error term above is always 


0 ((+n/0)?n***) 
and if we make the optimal choice Q = n? this is 
O(ni**). 


Thus Hardy and Littlewood were able to treat r,(m) whenever s > 3 and they 
established, on the generalised Riemann hypothesis, that, when n = s (mod 2), 
ns! 
rs(n) ~ GD! = pies) 


where 


cpt! (1) 


pin 
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When s = 2 they were unable to prove anything quite as precise but they could 
establish that 


n 


Y- (r2(m) — m&x(m))” = O (n?+*) (9) 


m=1 
2tm 


This is the strongest evidence we have that for even n, 
r2(n) ~ nGa(n). 


Of course, why stop there. How good an error term should one expect? Well, one 
could speculate that 


r2(n) = nG2(n) + O(n?**) 


and if true, then this would be essentially best possible as Montgomery and Vaughan 
[68] have proved unconditionally that 


n 


) (r2(m) — m©>(m))" = 2 (n° (log n)’) : 
m=1 
2tm 


Returning to (9) it follows quite easily that the exceptional set 
E(x) = card{m : 2|m,m < x, ro(m) 4 0} (10) 
satisfies 
EQ) =0(x?**). (11) 


Of course, this assumes the generalised Riemann hypothesis. 


5 Vinogradov 


In [92] Vinogradov introduced a fundamental new method which enabled theorems 
of the following kind to be established. 


Theorem 1. Let 


F(a) = )\e(ap). 


psn 
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Suppose that q € N, a € Z, (q,a) = 1 and |a — a/q| < q-*. Then 


1 
F(a) =O (ndoem’ (q-? + (n/q)~?) + nexp ( = ; vie") ) : 
This gives non-trivial estimates when 
(logn)* <q <n(logn)™“, |a —a/q| < (logn)*n™'q" 


and complements what can be deduced from the more classical theory of Dirichlet 
L-functions. Thus Vinogradov was able to establish that if n is odd, then 


2 


n 
R3(n) ~ Dlogns 3): 


Of course the method also gives 


> (Ro(n) = li2(n)G2(n))” = O (n° log n)) 
m=1 


2|m 


for any fixed positive number A, and consequently 
E(x) = (x(logx)™“) , 


and this was established independently by Chudakov [14], van der Corput [15] and 
Estermann [21]. 
Later Montgomery and Vaughan [69] pushed these ideas further and obtained 


E(x) = O(x'*) 


for some positive number 6. A number of authors have computed values for 5. The 
first was Chen [13] who obtained 6 = mw More recently Lu [61] has obtained 
§ = 0.121 and Pintz has claimed 5 = 3. 

Let me briefly describe Vinogradov’s idea. In principle it works well for sums of 
the kind 


Sf) 


psx 


where the function f is oscillatory, so one expects some cancellation, but is not 
multiplicative. Thus e(ap) or x(p + c) are fine, but 7(p) is not. Actually this rule of 
thumb can be broken in applications when we average over a large class of 7. See 
Sect. 11 for an example. 
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Of course in many ways the primes are rather randomly distributed so such sums 
are hard to deal with. Vinogradov is able to apply the sieve of Eratosthenes in a 
rather non-standard way to relate the original sum to two types of sums which are 
more tractable. These are as follows. 


Type Isums. These are sums of the form 


Yia S> flim) 


l<zm<n/I 


where the a; are complex numbers which satisfy, at least for the sake of this 
exposition, 


|a;| = O(log” 21) 


and are “good” when z is, say, some power of n smaller than n. 
Type II sums. These are sums of the form 


Soar So buf (im) 


I>z z<m<n/1 


where the a; and b,, are complex numbers which satisfy 


> la)" = O(xlog® x), Ss |bn|” = O(xlog® x). 


I<x m<x 


These can be considered “good” if z is not too small by comparison with n. 


Why have I used the terms “good”? Well, consider the Type I sum 


> ay » e(alm). 


l<z m<n/1 


Here the inner sum is just the sum of the terms of a geometric progression, 
so provided that wm ¢ Z the inner sum is bounded by 


1 


| sin zal| 
and so the double sum is bounded by 
n 1 
log x)© min | —, ————— 
(log.x) dX (; =) 


and this is something which is quite nice to deal will as long as z is somewhat smaller 
than n. 
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How about the Type II sums? In the original treatment Vinogradov makes mul- 
tiple applications of the Cauchy—Schwarz inequality which look a bit mysterious. 
Here is what is really happening. We can suppose the support of both the a; and b,, 
lies in the interval (z, x/z) and we can write the double sum as 


ab" 


where .@ is (essentially) an (x/z— z) x (x/z— z) matrix with general entries f(/m). 
Applying Cauchy-Schwarz once replaces this by 


y (a.a*)bWM*b*. 


Now 


MM” = (Sanz) 
I 


km 


is a Hermitian matrix and it turns out we just need a bound for the largest eigenvalue. 
One such bound is 


max ) 
k 


m 


Yo f(kD ifm 
1 


In the case f(x) = e(ax) this is also quite nice to deal with. 

Note that I have oversimplified this explanation somewhat, but in principle it 
is the way one proceeds. In particular there is an important technicality I did not 
mention in order to avoid obscuring the underlying ideas. When one applies the 
Cauchy—Schwarz inequality, in order to get the best out of it is a good idea to ensure 
that the order of magnitude of the terms does not vary too much. Thus in the Type II 
sums it is usual to divide the sum into dyadic ranges, that is into subsums of the kind 


So a SY) buf (im). 


2r<l<2't+1 — z<m<n/1 


Another wrinkle that is useful if one is trying to squeeze as much as possible out of 
the method is to split some Type I sums into two ranges 


doa do fm + 2 Flim) 


l<u m<n/1 u<l<zm<n/1 


and treat the second sum here as a Type II sum. 

To obtain good Type I and Type IJ sums Vinogradov applies the sieve of 
Erathosthenes, but then has to make various combinatorial rearrangements, which 
for the most powerful results were not well understood. Here is a much simpler way 
of proceeding [84]. 
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Consider the trivial identity 


- £00) = Fs) + G6N(~ £9) 


= FOIGEEGE) + (KO) ~ FEW) (TZ -GO) AD 


where 


We can apply the identity theorem for Dirichlet series to give a partition of the von 
Mangoldt function 


A(m) = co(m) + c1(m) — co(m) + c3(m). 


Now multiplying by f(m) and summing over m we obtain 


Y> A(mf(m) = So + Si — Sp + $3. (13) 


m<n 


The sum 


So = Do AMF) 


k<u 


will be small for u small so we can treat that trivially. The sums 


Si = D7 HO DY) Cog myf (im) 


l<v m<n/l 
and 
S.= a, > flim) 

I<uv-— m<n/I 

with 
a= > Auk) 
j<u 
k<v 
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are good Type I sums, and 


83= Joa SY) u(mftin) 


Su v<m<n/l 
with 


c = logl— 9° A(k) 
k<u 


Kl 


is a good Type II sum. 
When applied to the generating function 


F(a) = ) (log p)e(ap) 


psn 


the method gives 


Theorem 2. Let 


F(a) = ) (log p)e(ap). 


psn 


Suppose that (q,a) = 1 and |a — a/q| < q~?. Then 
F(a) = O(n(logn)? (q7V? + 2-9 + (n/qy-")). 


This method has had many other applications. Vinogradov and Hua [49] have 
extended the method to sums of the kind 


Yi e(ap’) 


psx 


and consequently there is a large body of work on the Goldbach—Waring problem. 
That is, on the solubility of equations 


pipe bp =n 
in primes pj,...,Ds. 


There are also a number of rather different applications. One example, due to 
Vinagradov [93] himself concerns non-trivial bounds for sums 


> x(p +a) 


psx 
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where y is a non-trivial character modulo g. There are many papers on this and 
related topics. For a recent incarnation see Friedlander et al. [24] Another due to 


Piatetski—Shapiro [71] states that when | < c < x 


card{n < x: |n°|prime} ~ Toex 
clogx 


There are quite a number of papers increasing the upper bound for c. The best that I 


have seen is 


1< 228 = 148 
<c aps — 118. 


due to Rivat and Wu [77] 


6 The Goldbach Ternary Problem 


Now let me give a brief outline of the proof of Vinogradov’s three primes theorem. 
To ease expository detail we consider r3(n) given by (8), rather than 


R3(n) = card{p,, Pr, P3 : Pi + P2 + p3 = ny 


considered by Vinogradov. 


Theorem 3. Suppose that A is any fixed positive real number. Then 
1 
r3(n) = 5 San) +0 (n* (log n) “) 


where 


con (1(++ 29)) Man) 


ptn pin 


From this it follows that every large odd integer is the sum of three primes. Some 
of Vinogradov’s exposition of this are somewhat complicated by his desire to avoid 
results in which implicit constants are not computable, with the aim in mind of 
computing an no beyond which all odd n are the sum of three primes. Recently 
Helfgott has spoken about showing that every odd integer greater than 5 is the sum 
of three primes, and it has been written up [42], although I am not sure whether it has 
been accepted for publication. Helfgott refines the method by taking a generating 
function 
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F(a) = )~ w(p)e(a@p) 


psn 


with more complicated weights w(p) than log p. These have the effect of easing the 
size of some of the constants which occur in various estimates. 


Proof. We have 
= | F(a)’e(—an)d 
r3(n) [ (a)>e(—an)da 


where { is a unit interval, and we expect that r3(n) is roughly n? in size. A crude 
bound for the integrand is 


3 
(x ntoep)} ~n, 


pS 


Thus we have to “save” at least n. 
Let B be given and assume n is sufficiently large in terms of B. For convenience 
let L = (logn)? and consider the intervals (“major arcs”) 


M(g,a) = {a : ja —a/q| < Lg 'n™"} 
for 
l<a<q<L, (aqg=1. 
They are disjoint and contained in the unit interval 
U= (In!,14+ Ln"). 


Let Mt denote their union and define their complement with respect to LU, the “minor 
arcs”, by 


m = U\. 


We are hoping and expecting that the contribution from m is 0(n*), so we have 
to save more than n. For an a € m we use Dirichlet’s theorem on diophantine 
approximation to provide us with g,a so that 1 < a < gq < nL“, (qg,a) = 1 and 
la —a/q| < q_'Ln“!. Since a ¢ IN it follows that g > L. Applying Theorem 5.2 
we find that 


—_B 


sup |F(a)| « n(log n)? 2, 


aem 
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But this only “saves” a power of a logarithm. However we can use Parseval’s identity 


/ |F(a)|?da = Y “(logp)” ~nlogn 
Te 


psn 


which “saves” n(logn)~!. Thus combining the two 


] [F(a)|3do = O (n?(ogn)#* ) 
m 
and this is 

O (n° (log n) “) 


provided that B > 2A + 7. This argument clearly works with any exponent greater 
than 2 in place of the 3 but fails when it is 2 and this is one reason why binary 
problems are so much harder than ternary ones. 

By the way, although 


/ |F(a)|?da = Y “(log p)? ~nlogn 
u 


psn 


it is probably true that for any fixed k # 0 
/ |F(a)|?e(2ka)da ~ nG(2k) 
u 
as n —> oo. With m defined as here this is equivalent to 


/ |F(a)|>da ~ nlogn 
m 
but 


/ |F(a)|?e(2kar)da = o(n). 


m 


Having shown that the contribution from the minor arcs is small compared with 
the expected main term it remains to deal with the contribution from the major arcs. 
By invoking a standard theorem (the Siegel-Walfisz theorem, [70] Corollary 11.19) 
on the distribution of primes in arithmetic progressions it follows that whenever 
l<a<q<L,(a,q) = 1anda € M(q,a) we have 


n 


F(a) = a > e((a —a/q)m) +O (nexp (- cy/logn)) ; 


m=1 
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On each major arc we replace F(a) by the main term here. The total contribution 
from the error term to the integral over 0 is 


O(n’ (logn)~“*) 


The main term contributes 


H(q) 3 
5 5 3 e(—an/q) T(B)e(—Bn)dB 
ar eat OG )? ee 


(a,q)=1 


where 


T(B) = >> e(Bm). 


m=1 


The function T(f) is bounded by ||f||~! and so the interval [—L/(gn), L(gn)] can 
be replaced by [-3. 5] with an acceptable error term. Then the integral is simply 
the number of solutions of m; + m2 + m3 = n in positive integers m;, and so is 
$(n — 1)(n — 2). The sum over g can then be completed to infinity. Thus one finds 
that for B > Bo(A) 


(n—2)(n— 1) 
2 


/ F(a)%e(—an)da = 63(n) + O (n° (log n) “) 
m 


where 


7 Twin Primes and Prime k-Tuples 


We have already observed that the ideas I have described so far are quite effective 
for ternary problems, but much less so for binary problems. However they have been 
quite successful in creating conjectures. The procedure is quite simple. One assumes 
without proof that the minor arcs give a negligible contribution! In this way Hardy 
and Littlewood were able to write down a large class of plausible conjectures. One 
such is the following. Let 


Wax) = D2 A(n)A(n— 24). 


2k<n<x 
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Then 


Wak (x) ~ xG2(2k) 


as x — oo. Whilst this seems well out of reach, Lavrik [56] (van der Corput [15] 
had earlier shown something similar but with the sum over n replaced by a sum 
restricted to primes p for which p — 2k is also prime) has shown that 


S> (Waele) — (x = 2k)G2(2K))? K Blog. x)™. 
k<x/2 


We shall see later that this has its uses. 

Another of the Hardy and Littlewood conjectures concerns k-tuples of primes. 
Suppose thath = /,,...,h, and let v(qg) = v(q;h) be the number of distinct 
residue classes amongst the h. Let 


my(x;h) = card{n <x:n+hy,...,n+ hy all prime} 
and suppose that for every g € N we have v(q) < q. Then 


Xx 


r(x; h) ~ Cogn OH) 
where 
k kp! p* —p*v(@p) 
Sih) = [T]* I] 
pta (p— 1" plA =i) 
and 


A= |[[ |Ai-Ajl. 


l<i<j<k 


Of course, this is still well beyond our compass. However considerations closely 
connected with this conjecture underly the recent exciting work by Zhang, Maynard 
and Tao on bounded gaps in the primes, and I will say more about this later. 


8 Brun and Selberg 


There is another approach to the Goldbach and similar problems. This is the second 
of the two methods which arose almost simultaneously, namely the advent of a 
useful sieve method. Of course the idea of the sieve by Erathosthenes is over 2200 
years old, but it was not until Brun [11] that a concerted attempt was made to apply 
the ideas in a more sophisticated way. 
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Generally given a sequence </ = {a(m) : m € N} of non-negative real numbers 
we are interested in 


S(@H,z) = ) a(m) 
meN 
(m.P@))=1 


where 


P@) =[|p 


pS 
and we make some basic assumptions about divisibility and growth, such as 


Aa = oat) = 9 Ox + ry (14) 


meN 
d|m 


where p is a multiplicative function, X is some measure of A; and, we hope, Rg is 
somewhat smaller than X, at least on average over squarefree d. For example, if we 
are interested in the twin prime conjecture we might take 


1 m=l1(1+ 2) for some/ < X. 
a(m) = 
0 otherwise. 


Then p(d) is the number of solutions of the congruence 
(i+ 2) =0 (mod d) 


and it is readily seen that p is multiplicative, o(2) = 1 and p(p) = 2 for odd 
primes p. Moreover the above holds with 


|Ra| < p(d). 


The fundamental problem with this is that if we proceed in the obvious way by using 
the Mobius function in the form 


~~ H@) 


d|(m,P(z)) 


to pick out the condition (m, P(z)) = 1 and obtain an expression of the form 


Xx 2 pon 4 2 Ry 


d|P(z) d|P(z) 
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the sums over d will have far too many terms. Thus the primary object of sieve 
theory is to find functions AF such that 


dor S e®@ s 


dll d|l dll 


which give reasonable results but whose support is contained in, for example, 
[1, D] for some parameter D. Brun was the first to do this successfully by adapting 
Sylvester’s principle of inclusion—exclusion for sets to construct A7. The methods 
are very combinatorial and result in considerable complexities. However there was 
an immediate success for the method. The simplest of Brun’s upper bound methods 
leads to the inequality 


x = (eee) . 
pe (log x) 
p+2prime 


It follows that 


1 
YD p< 


P 
p+2prime 


and so dashes any hopes of proving that there are infinitely many twin primes by 
imitating Euler’s proof that there are infinitely many primes. 

Lower bound methods lead to less clear cut conclusions. Let Y, denote the 
subset of N consisting of those m > 1 with at most k prime factors. Brun was able 
to establish that given any large even n there are /, min Wp such that n = 1+ m. The 
ultimate theorem in this direction, after many technical developments, and including 
the advent of the large sieve and the Bombieri—Vinogradov theorem for which one 
can see Sects. 10 and 11 below, is the celebrated result of Jing-Run Chen [12] that 
every large even number can be written as the sum of a prime and an element of Y. 

There is a particularly effective upper bound method due to Selberg [81]. Selberg 
observes that if we suppose that the sequence {A(n)} has A(1) = 1 and its support 
is a subset of the divisors of P(z), then 


2 


S(L,2) < Yi a(m) | 9 A@) 


m d|m 


On multiplying out, interchanging the order of summation and inserting the 
assumption (14) we obtain 


De A(d)A(e) p({d, el) 


dq tT LAMAR: 


de 
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Now it is supposed that the second term here can be taken care of by an appropriate 
choice for the support of the A(d) and will then be suitably bounded. Thus the initial 
task is to minimise the quadratic form 


A(A)A(e)p([d. el) 
B [d, e] 


de 


subject only to the constraint A(1) = 1. It is useful at this stage to suppose further 
that the support F of the A(d) is divisor closed, ie if d € Y, then e € F whenever 
eld, and commonly this is done by taking 


Q ={d:d|P(2),d <D} 


where D is a parameter at our disposal. It turns out that this minimisation process is 
quite easy and the minimising choice gives 


x 


S(A,z) < GD.) 


+ 2 u@* HC) Rial 


d<D e<D 


where 


G(D, z) = dy PO)... 
(D, z) 4 OTN 6G) 
d|P(z) 


In the case of the Goldbach conjecture, given an even natural number n we could 
take a(m) to be the number of / with (n — 1) = mand 1 < / < n—1, so that 
a(m) = 0, 1 or 2. Then 


Ag _ —y t= 1) + Ra 
where p(d) is the number of solutions to the congruence /(n—/) = 0 (mod d). Thus 
p is multiplicative, p(p) = 1 when p|n and p(p) = 2 otherwise. It turns out that the 
choice z = D = x2 (log x)~# leads to 


ROS See o( 


n(log log 4) 
(log n)? 


(log n)? 


This misses the conjectured result by only a factor of 8, and it is remarkable that such 
a simple and elementary method will do so well. There are various wrinkles that can 
be applied to this. For example one can take a(m) to be 1 when m < n andn— mis 
prime and 0 otherwise. Then Selberg’s method combined with Bombieri’s theorem 
(see Sect. 11) enables one to replace the 8 by 4. Further small improvements can be 
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made by using more complicated weights a(m) which take account of information 
from lower bound sieve methods as well. The best that is know is a constant a little 
bit smaller than 3.5. 

The recent sensational work on small gaps between primes (see Sect. 15) shows 
that there is further information which can be teased out of these methods. 


9 Schnirelmann 


There is a third attack on the Goldbach problem, due to Schnirelmann [80] which 
uses just an upper bound sieve method, and then not directly. It appeared before 
Vinograd’s method, and so for a short while was the only unconditional game in 
town. It also had the merit that it stimulated a whole new area. 

Schnirelmann’s idea is to consider densities of sets of integers. Given subsets 7 
and & of Z it is natural to consider an new set @, 


C=AS+B, 


the set of integers of the form a+ b witha € & andb € Z. It is also handy to 
define iteratively for s € N, 


sA = A +(s—l)o 


and to take 


A(n) = 1 
1 


m= 


mea 


Now suppose we have some idea of density. Then we would hope that the density 
of @ is greater than the density of either </ or &. If we are being really optimistic, 
then we might hope that the density of @ is equal to the sum of the densities of / 
and &. Unfortunately this fails for any of the more natural definitions of density, 
such as lower asymptotic density 


A 
df = liminf AM) 
noo n 
For example, let Let </ and & both consist of the zero residue class modulo q. Then 
aA + & gives nothing new. 
Schnirelmann’s clever idea is to consider 
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the Schnirelmann density of /. The nice thing about Schnirelmann density is that if 
a(@#)=1 
then 
Ncw. 
Note that if </ contains 0 or any negative integers, then they are not counted by 


A(n). However 0 plays an important réle. Schnielmann conjectured the following, 
which was eventually proved by Mann [63]. 


Theorem 4. Suppose that0 € & and0 € &, then 

o(&% + Z) > min (i, a(@) + o(f)). 
Obviously the primes have density 0, and generally there is a parity problem when 
considering primes. However, consider the set .e% consisting of 0 and the n such that 
2n is prime or the sum of two primes, and suppose this has positive density, say 


o(Y%)>s 


for some positive real number 5. Then choose s so that si > 1. Then by repeated 
application of Mann’s theorem it follows that 


a(s#) = 1 
and so 
NcCs@#. 
Thus every even number is the sum of at most 2s primes. 
How might one show unconditionally, prior to Vinogradov, that ./ has positive 


density? The answer lay with the Brun sieve. By the prime number theorem 


2x? 


ae Yo < Yon) = Yo nm) + 2x(2x). 
( O8 x) P1,P2 n<2x m<x 
Pitp2S2x 


Thus 


(2+ o(1))x? 
“Gog? = 272m) 


max 
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and the Cauchy-Schwarz inequality gives 


(44 o(1))x4 


(logs)! < A(x) » r2(2m)?. 


msx 
Thus the upper bound sieve can be applied to r2(m) to obtain for m > 2 


(C + o(1))m 


r(2m) < toga? 


G2(2m) 


and s straightforward calculation then gives 
A(x) > 6x. 


for x > 1. In the early days the sieve gave a poor value for C, and only results 
somewhat weaker than Mann’s were available, so the lower bounds for s were 
rather large. Nevertheless the methodology is not without interest, especially since 
it provided a result for all even n. It has been refined by a succession of authors. The 
ultimate result is due to Ramaré, [74] who proved that every even number is the sum 
of at most six primes. 

Let me return to lower asymptotic density. As already pointed out there are 
problems with developing a theory comparable with that of Schnirelmann when the 
sets . and & are unions of residue classes. However, there is a remarkable theorem 
of Kneser [53] (see also Halberstam and Roth [30, Chap. 3]) which tells that this is 
essentially the only bad situation that happens. 

From Kneser’s theorem it is possible to deduce the following very general 
form of the local to global principle (Banks et al. [2]) which has many potential 
applications in additive number theory. 


Theorem 5. Suppose that there are numbers s,, 82 such that 


(i) For alls > s, andm,n €N, the sumset so has at least one element in the 
arithmetic progression n mod m; 
(ii) The sumset 82 has positive lower asymptotic density, i.e., d(s.2) > 0. 


Then, there is a number so with the property that for any s > S the sumset so 
contains all but finitely many natural numbers. 


Theorem 5 has several interesting consequences, such as the next theorem. 


Theorem 6. Let Y be a set of prime numbers with 


x ig 
lim inf ———— > 0. 
X00 X/logX 
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Suppose that there is a number s, such that for alls > s; and m,n € N, the 
congruence 


Pi +-+++ps =n (mod m) 


has a solution with p,,...,ps € Y. Then, there is a number so with the property 
that for any s = so the equation 


Pie Psa 


has a solution with p,,...,ps € FP for all but finitely many natural numbers N. 


10 The Large Sieve 


There is another sieve, the large sieve, which apparently, and certainly in its original 
manifestation, has nothing to do with the Goldbach problems. It was created by 
Linnik who had in mind a very important application to the least quadratic non- 
residue. 

Let the a, (M+ 1 <n < M+N) denote arbitrary complex numbers. In the 
application they may well be specialised to be the indicator function of some set. 
Let 


M+N 


Z(g:h)= Yo a 
n=M+1 
n=hmodq 


and for brevity write Z = Z(1; 1). Let #(p) be a set of residue classes modulo 
p which have the property that ifh ¢€ #(p), then Z(p;h) = 0, let p(p) = p — 
card. #(p) and consider 


Z 2 


. 
2 Ze Se) 


he F€ (p) 


By multiplying out this becomes 


lz? 
P— pip) 


G) 


Z\? 1 P 
h)|? = 
YW: dies p= >) 7 pee 


h=1 
where 


M+N 


S(a) = SS ane(an) 


n=M+1 
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and we have used the orthogonality of the additive characters modulo p. Note that 
S(1) = Z. Thus on rearranging this and summing over primes p < Q we find that 


(5). 


p(p) 
Ye Y pon] teh rE} 


PSQ he Z(p) ps<@Q p<Q a=1 


The sum on the right is bounded by 


M+N 


Ao(N, Q) lan|? 


n=M+1 


for some choice of Ag(N, Q) independent of the a,. If we take, as suggested above, 
the a, to be the indicator function of some set omitting p(p) residues classes modulo 
p for each prime p < Q we find that 


Ao(N, Q) 


Z< Ome 
= p 
dr<o Ppp) 
There are no restrictions on the size of p(p), which is the reason why any decent 
estimate for Ag(N, Q) is called the large sieve. In Linnik’s case p(p) = aa when 
p is odd. In this way he was able to show that for any fixed 6 > 0 the mumiber of 
primes p < x for which n(p) > p® is O(log log x). 
More generally an inequality of the kind 


M+N 
5 |s(a)2 < AW, 8) > lanl? 
r=1 n=M+1 


is called the large sieve. Here 


6 = mina, — al]. 
ry 


There was a considerable amount of work on this, beginning with seminal papers of 
Roth [79] and Bombieri [4] in the mid 1960s, and we now know that we can take 


A(N,6) =N-1+4+65"' 


and that this is best possible. There are two proofs, Montgomery and Vaughan [69] 
combined with an idea of Cohen, and Selberg [81]. Thus 


M+N 
yd s(¢ )I = (V-1402) > Jal? 
qsQ_ a=l n=M+1 


(a.q)=1 
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A further development was that Montgomery [31] was able to make use of the 
composite g, and obtained, again supposing that the a, are 1 or 0, 


N-1+@ 
2 p(p)_* 
yo Hq) Tota p—p(p) 


Z< 


Wait a minute! The sum in the denominator is the same as occurs in the Selberg 
sieve, and that Q? in the numerator looks cleaner to deal with than the remainders in 
Selberg’s sieve. The main drawback is that one has to work on an interval. In fact, 
it turns out that, in some sense, this is the dual of Selberg’s sieve. Anyway, this can 
certainly be applied to the Goldbach problem. For example, let NV be an even natural 
number and take a,, = 0 if p|n or p|N — n for any prime p < Q = VN. 

Moreover there is a lot more information to be extracted from the large sieve 
inequality. Let 


M+N 


SM = YS azn) 


n=M+1 


where x is a Dirichlet character modulo g. Then for primitive characters the 
relationship 


q 
De Ae(nx/q) = x(nyrQ. 
x=1 
where the Gauss sum 1 (7) satisfies t (7), gives 
M+N 


» Ae Y- Is@P <W-14+@) > lanl? 


qs Xmod q n=M+1 


* 
where a indicates that the sum is restricted to primitive characters. 


11 Bombieri 
In [4], Bombieri was able to use the large sieve to establish the following. Let 


d(y:q.a)= >) (logp). 


psy 
p=amodq 
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Theorem 7. Given any fixed A > 0 there is a B = B(A) such that if Q < 
x2 (log x), then 


y max sup 
a 


q<Q (a.q)=1 ysx 


U(y; g, a) — «K x(logx) 4. 


mal 
$(q) 


This is a very powerful result and can substitute for the Generalised Riemann 
Hypothesis in many applications. Independently and simultaneously a slightly 
weaker result was established by Vinogradov [91]. 

Apart from the application to Goldbach via the sieve mentioned in Sect. 8, there 
is another connection with Goldbach. Bombieri’s proof is quite technical, and first 
uses the large sieve to obtain bounds for Dirichlet polynomials. These in turn are 
used to obtain zero density estimates for the zeros of Dirichlet L-functions. Finally 
the zero density estimates are applied via the explicit formula for 


void = >> AM) x(n). 


ny 


We now have simpler proofs, and perhaps the simplest is via the identity of Sect. 5 
which gives a partition of sums involving the von Mangoldt function into good Type 
I and Type II sums [85]. Thus things have come full circle. A technique developed 
to treat the Goldbach problems is then used on a cognate question which then has 
implications back to Goldbach. 


12 Montgomery and Hooley 


We are not finished with the large sieve. Barban [3] states without proof a precise 
estimate for 


q 2 
me0=>)) >> (vera - =) 


qx a=l1 
(a,q)=1 


where 


Vesga= Yo A(n). 


nsx 
n=a mod q 


Davenport and Halberstam [17] then used the large sieve in a more sophisticated 
way to show that if x(logx)~4 < Q < x, then 


M>(x; Q) = O (Ox(log x)”) . 
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This was refined further by Gallagher who showed that 
M2(x; Q) = O(Qx(logx)) . 
Gallagher, inter alia, had given a very simple proof of a large sieve bound 
A(N, 8) <aN+Q? 


in which he compares 
ay+és 
sca,)Pwitn [|(B)Pap. 
a,—5 


When I first saw this, it was in Roth’s inaugural lecture at Imperial College in 1968, 
I got very excited because, if you think of the specialisation to the case when the a, 
are the rationals = this looks awfully like the Hardy—Littlewood method. 

In [66] Montgomery was able to give a more precise version, 


My(x; Q) = Oxlogx + O(QOxlog Q + x (log x)~“). 
The most remarkable thing about these results is that they give something that is 


beyond what can be deduced from the Generalised Riemann hypothesis. They are 
saying that on average 


Xx XxX 2 1 
Gay=—— = O01} a 2)). 
W(x 4,4) $@ ((:) (log q) ) 


Montgomery’s proof begins by multiplying out the left hand side. The terms 


q 
2° YO) vega 
; $(q) 


= 
(a.q)=1 


and 


2 iF 


sar Aes =i 


are easy to deal with. That leaves 


q 
Yd v@4aa”. 


q<Q a=1 
(aq=1 
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Since the residue classes with (a,q) > 1 contain very few prime powers this is 
essentially 


YY YM AMA 


qe<Qmsx nsx 
n=mmodq 


and this can be rewritten as 


OY > Anyr +2595 YO AM-DAM). 


nsx q<@Q I<x l<n<x 
ql 


Since the vast majority of prime power differences are even this is essentially 


OY Any’? +295 0 SO An = 2k) ACW). (15) 


n<x k<x/2 q|2k 2k<n<x 
qsQ 


Then Montgomery appeals to the result by Lavrik 


2 
> (> A(n) A(n — 2k) — (x— 26210 = O(x(logx)“), 
k< x n<x 


that mentioned in Sect.7, to show that the sum 


S> A(n = 2k) A(n) 


2k<n<x 


in (12) can be replaced by 
(x — 2k)G2(2k) 


with an acceptable error. The proof is then completed in a routine way. 

Thus Vinogradov’s method for dealing with the ternary Goldbach problem has 
been used to say something quite deep about the distribution of primes in arithmetic 
progressions. 

Soon afterwards Hooley [45] found a more elementary argument which gives 
even more precise results. However this is not the end of the Hardy—Littlewood— 
Vinogradov method. In 1996, Goldston and Vaughan have adapted that method 
to take things even further, and on the assumption of the Generalised Riemann 
hypothesis have obtained an essentially best possible error term. 
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13. Hooley and Third Moments 


Hooley [44] also considered the third moment 


qd 


3 
M3(x;2)= oq > (96100) - 3G 5) 


qsQ a= l 
(a.g=1 


and has shown for that, any positive constant A, if Q = o(x/ log x) as x > oo, then 


3 
M3 (x, Q) = 0(O?x log? x) +0 (=) 


and has also shown that if x/ logx < Q < x, then 


4 > 2x 
M3(x, Q, wo, Po) = ~~ O*x log? x + O ( Q?x log x log oO): 


2¢ Bi 


Later I was able to simplify the proof and obtain somewhat stronger results by using 
a variation on the main term which takes better account of the fact that when q is 
close to x there are relatively few primes in each residue class modulo q. 

Let me briefly describe the underlying method. We can concentrate on the terms 
with R < q < Q for some suitable R =, say, x(logx)~“. Then by considering the 
difference of the intervals Q < q < x and R < q < x and cubing out the general 
term gives four expression 


3 


x 
ae o(q) 


R<q<x 


3 s y D(x; 1.9055 @’ 


R<qsx i a=1 


qgj=1 


q 
3 > a B(x: g, a)?x, 


R<q<x a=1 
(a,q)=1 


> ¢@ > O(a: q,a)° 


R<qsx 
(a. D= 1 


for various choices of R. The first two are easily dealt and the third much as in the 
case of the second moment. The fourth can be rewritten as essentially 
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Y> ¢() >> — (logpi)(logp2) (log ps). 


R<q<@Q PISX,p2rSX,p3<x 
1=m=n card q 


Terms with p; = p,; for some j ¢ k can be dealt with similarly to the second and 
third sums. This leaves the case when the p; are distinct. This contributes 


oy ye 2 Gog p:)(log p2) (log ps). 


R<qsx j<x,k<x PiSx,prSx,p3<x 
qlj.q\k P3—P2=k. p2—pi=l 


Thus we need to examine the system 
P3 — P2 = 8, p2— Pi = A. 


These differences do not exceed x, whereas g > R. Hence s < x/R andr < x/R, 
and x/R = (log x)“. Moreover, solving for g, we have 


P3— P2 _ P2—P\ 
5 r 7 


in other words 
rp3 — (s+ r)p2 + sp; = 0. 


Again, this is a situation which can be treated successfully by the Hardy-Littlewood- 
Vinogradov method. 


14 Hooley’s Conjecture 


Let 


qd 


2 
ved= (Vina - =) | 


a=1 
(a,q)=1 


In view of the nature of the second moments, Hooley [43], has suggested that, for 
some unspecified range of g, 


Va, gq) ~ xlogg 
and in 1975 he showed that if 


x(logx) 4 <Q<x 
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then (12) holds for almost all g in the range g < q < Q and that on the generalised 
Riemann hypothesis the range (14) may be extended to 


4 
de 20 = x. 


Friedlander and Goldston [25] have shown that a better approximation to V(x; q) 
should be 


logp 
p-1 


U(x, q) = xlogqg—x| y+ log2m + x 
plq 


and have obtained various estimates for 


Y= 1V(x.g) — Ue. Q)I - 
g <qsO 
In Vaughan [87] the subject was developed further by treating the general moment 


My(x,0)= > |V@,q)-UG.Q)I. 


2 <q<O 


Once more the Hardy—Littlewood method plays a crucial réle. 


15 Small Gaps Between Primes 


The modern work on small gaps between primes starts with Hardy and Littlewood. 
In Sect. 4 it was mentioned that the seventh paper in their “Partitio Numerorum” 
series was never published. This deals with small gaps between prime numbers. Let 
2 = pi < p2 < ... be the sequence of primes in their natural order. The prime 
number theorem tells us that the average size of p,+1 — Pn is log py, So at once 


liming +! P* < 1, 
noo log Pn 


Hardy and Littlewood had the idea of considering 


A 1 A 
Y= |nhnatsh) =f [real e(a)Paa 


h=—H 0 h=1 
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where 
m(xh)= >) (logp:)(logp2) 
PIS, p2<x 
p2—pi=h 
and 


F(a) = } ‘(log p)e(ap). 


psx 


On the generalised Riemann hypothesis they were able to take 


2 
H= (5 +5) log x 


and deduce that the right hand side is so large that the left hand side contains non- 
zero terms with h ~ 0. Thus 


Erdés obtained an unconditional result and Rankin [75] and Ricci [76] both made 
contributions, but the first paper to make significant progress is Bombieri and 
Davenport [5] who used the large sieve and the Selberg sieve in the above method 
to obtain 


n —~ Pn 2 3 
inne oe + v3 


= 0.466. 
noo log Pn = 8 


This was successively reduced by Pilt’jai [72], Huxley [47, 48], and Maier [62] to 
0.248. 

The recent spectacular advances on small gaps by Goldston et al. [27], Zhang 
[94], Maynard [64], Tao are based around a rather different idea, that of seeking 
primes in “admissible” k-tuples of integers. Let 1p be the indicator function of the 
primes P. Typically an expression of the kind 


Ze (Steer +4) ~»)( ~ 2) (16) 


N<n<2N j=l d<R 
n=a (mod q) djn+h 
(d,q)=1 
is considered where, perhaps, d denotes the k-tuple dj,...,dk, djln + hj 


G = 1,...,k) and d = d,...d,. Other configurations are possible. Maybe 
d =d|(n+h,)...(n+h,). The idea is to use Selberg A which are related closely to 
those which would arise in applying his upper bound method to the prime k-tuples 
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conjecture. Since it is only an upper bound sieve method this does not ensure that 
each n + h; is prime, but it does make it more likely that primes occur amongst the 
n+hy,...,n + hx. If it can be show that the expression in (16) is positive then it 
will follow that there are n such that 


k 
S>Ip(n + hj) > p. 


j=l 


As of 25th May 2015, the best results achieved by these methods are that 
unconditionally 


lim inf Pa+1 — Pn < 246, 
noo 


and if one assumes the Elliott-Halberstam conjecture [19], then 


lim inf p41 — Pn < 12. 
noo 


This method has very great flexibility and many potential applications. One 
is to Dickson’s conjecture [1904] which states that if the g;, h; are integers and 
Me, (gjn + h;) has no fixed prime divisor, then there are infinitely many n such that 
the g;n + h; are simultaneously prime. Perhaps the most exciting application has 
been to large gaps, where Ford et al. [22] have solved a $5000.00 problem of Erdés 
by showing that there is a positive constant c such that for infinitely many n 


c(log n) (log log n) (log log log log n) 


n —Pn > 
cues log loglogn 


16 Goldbach Generalised: Partitions into Primes 


There are a variety of generalisations of Goldbach type problems which have been 
considered recently and to all of which one can apply the Hardy—Littlewood— 
Ramanujan—Vinogradov method. One such concerns the asymptotic formula for the 
number p(n) of partitions of n into primes. Let 


2) => p@z=[[a-2)"' 
n=0 Pp 


and 


sed kp 
YO) =) De 


k=1 p 
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which are readily seen to converge when |z| < 1 and satisfy 
P(z) = exp (W(z)). 

Moreover, for every k € N* and p € (0, 1), 

Ww (p) > 0 
and 

W®(p)>0o as pol. 

Thus, for every x > 0 the equation 

pW'(p) =x 


has a unique solution p = p(x) with p € [0, 1), and p(x) > 17 as x > oo. Thus 


as x > 00, 
1 log! 
xlog — = 7 ,/—— (14 ~2°8" + o(1/logy) J, 
p(x) 3 log x log x 


log | 
W(o(x)) = 2,/— (1 jes 0(1/10g.)) 
3 log x log x 


and the prime number theorem gives 


2 
W(p) = (o<) W(p) = 227 !(3 log x) 2x? (1 + O(log log x/ logx)). 


Then a variant of the method [88] will give the following theorem. 


Theorem 8. Suppose that n is sufficiently large. Then 


_ pln)" (o(n)) 
21Wy (p(n) 


p(n) (1+ 0(n-"*)). 


Theorem 9. We have 


p(n) 


p(n + 1) — p(n) ~ aalean 


as n > &. 
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17. Goldbach Generalised: Beatty Primes 


Another collection of recent applications concerns representations by Beatty primes. 
A Beatty prime is a prime in the set 


Bia, B) = {Lan+ Bl ine N} (17) 
where a, 8 € Rand a > 1. Thus, given a,...,a5, By,..., Bs € R with a; > | let 
R(n; ,B) = )° (log pi)... (log ps) (18) 

Pp 


* 
where ys indicates that the sum is over s-tuples of primes p; € A(a;, B;) with 


Pp 
Pi +++ Ds =n. 
See Banks et al. [1], Kumchev [54] and Vaughan [90]. Here is a sample theorem 
taken from the latter paper. 


Theorem 10. Suppose thats > 3,a; > 1(1 <j < 8), a,...,@s are irrational 
and there is a pair i, j such that a, a, | are linearly independent over Q, andn = s 
(mod 2). Then 


: ni! &,(n) 
R,(n; a, B) ~ et aN (19) 


asn > C&O 


18 Goldbach Generalised: Mixed Powers 


In addition to the Goldbach—Waring problem mentioned in Sect.5, there is a 
considerable body on mixed powers. One modern development has been to combine 
the Hardy—Littlewood method with sieve techniques. This was first done by Heath— 
Brown [41] when he showed that there are infinitely many configurations of three 
primes and an almost prime in arithmetic progression. Of course, this has been 
overtaken by the recent work of Green and Tao [29] (see also Green [28]), which 
itself is partially dependent on techniques which can be considered to descend from 
the Hardy—Littlewood—Ramanujan method. 

Let Y;, denote the set of natural numbers having at most k prime factors. Then 
the elements of Y; are called almost primes. In the context of the Goldbach—Waring 
problem the combination of the Hardy—Littlewood method with sieve techniques 
was first exploited systematically by Briidern in his Habilitationsschrift [6] and then 
by Briidern and Fouvry [9]. They proved that every sufficiently large integer n = 4 
(mod 24) can be written as the sum of four squares of P34-numbers. 
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This is an area which has really mushroomed and I am going to be very selective. 
To set the scene, consider Roth [78] who proved that if n is sufficiently large, 
then the equation 


ppt tpp=n (20) 


has solutions in primes p,...,p7 and a positive integer x. By comparison, if we 
insist the all the variables be prime, then the best that can be achieved is nine terms 
rather than eight. Briidern showed that, when n = 4 (mod 18), x can be restricted 
to be a P4-number. Then Kawada [50] replaced the Py by a P3. There is a good 
account of this and similar questions in Briidern and Kawada [10], Kawada [51] and 
Kawada [52]. 

Another class of questions that Briidern and Kawada consider is to show that in 
each case for almost all n such that 


n#0 (mod2), n=xr+pit+ps xe As (21) 
n#2(mod3), n=x+pi+ps xEeR (22) 
n#2(mod3), n=p+y+ps ye R (23) 
n#5 (mod7), n=x+pi+p, xe FP; (24) 
n#5 (mod7), n=pi+y+p, ye PY (25) 


there is a solution. Yet another is to show in each case that for 


n> no, neven, n=pptx+pit+p4 x€ P; (26) 
n > No, neven, n=ppt+x+p3+p% xE Ps (27) 
n> No, n even, n=x+pr+pitpi x€E Po (28) 


there is a solution. 
Recently Liu [60] has shown in each case if 


n > no, n odd, n=xt+pt+pitpitpi x€ Po (29) 
n > no, n odd, n=pitptptpty ye PY; (30) 
n> no, n odd, n=pitpatpit+yitys ye Po (31) 


then there is a solution. 
A different approach which works in some cases is to assume the Generalised 
Riemann Hypothesis. Thus in Vaughan [89] the following were obtained. 
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Theorem 11. Assume the Generalised Riemann Hypothesis. Then every sufficiently 
large even integer is the sum of a prime, the square of a prime and two cubes of 
primes. 


Theorem 12. Assume the Generalised Riemann Hypothesis. Then every sufficiently 
large odd integer is the sum of a prime and four cubes of primes. 


19 Goldbach Generalised: Other Rings 


Another rather obvious generalisation of the Goldbach questions is to other rings 
than Z. The irreducible polynomials play the réle of prime numbers. There might 
be some difficulties if one does not have uniqueness of factorisation. This is not 
a problem for polynomials over a field. It can be a lot easier than the classical 
case, but can nevertheless give rise to some interesting techniques. Thus Hayes 
[40] established a Goldbach theorem for polynomials over Z and there are many 
generalisations. See Pollack [73] for a generalization and a useful review of the 
literature. 

For polynomials over finite fields Effinger and Hayes [20] give a systematic 
treatment of the subject. There are many similarities with the classical situation. 
Thus it is possible to imitate the Hardy—Littlewoo—Ramanujan—Vinogradov method 
by introducing the Pontryagin dual. This gives an analogue of the fourier transform 
on the torus which is at the heart of the classical approach. 
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The Hodge Conjecture 


Claire Voisin 


Abstract This is an introduction to the Hodge conjecture, which, although intended 
to a general mathematical audience, assumes some knowledge of topology and 
complex geometry. The emphasis will be put on the importance of the notion of 
Hodge structure in complex algebraic geometry. 


1 Introduction 


The Hodge conjecture stands between algebraic geometry and complex geometry. 
It relates data coming from topology (a Betti cohomology class), complex geometry 
(the Hodge decomposition or filtration) and algebraic geometry (the algebraic 
subvarieties of a complex algebraic variety). We can state it very quickly by 
saying that it provides a conjectural characterization of algebraic classes, that is 
cohomology classes generated over Q by classes of algebraic subvarieties of a given 
dimension of a complex projective manifold X, as Hodge classes, that is those 
rational cohomology classes of degree 2k which admit de Rham representatives 
which are closed forms of type (k, k) for the complex structure on X. The geometry 
behind this condition is the fact that the integration current defined by a complex 
submanifold of dimension n — k annihilates forms of type (p,q) with (p,q) 4 
(n—k,n—k). 

Not much is known about the Hodge conjecture, apart from the Lefschetz 
theorem on (1,1)-classes (Theorem 2) and a beautiful evidence (Theorem 6) 
provided by Cattani, Deligne and Kaplan, which says roughly that Hodge classes 
behave in family as if they were algebraic, that is, satisfied the Hodge conjecture. 
What we plan to do is to explain the basic notions in Hodge theory (Hodge structure, 
coniveau) giving a strong motivation for the Hodge conjecture (and still more 
for its generalization, the generalized Hodge conjecture, see Conjecture 3). The 
Hodge structures on rational cohomology groups are very rich objects associated 
to a smooth projective complex variety, and the belief is that they carry a lot 
of qualitative information on the variety: Torelli theorems state that under some 
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assumptions, the isomorphism class of these Hodge structures determine the variety 
itself. In another direction, the Hodge conjecture is part of a general picture 
predicting that these Hodge structures determine the “motive” or at least the Chow 
groups of the variety. We also wish to present some of the most important facts 
from Hodge theory allowing to prove some conditional results or implications 
between various subconjectures. Some very important cases of the Hodge conjecture 
are summarized under the name of standard conjectures (see [21]), the main one 
being the Lefschetz standard conjecture (Conjecture 2). These instances of the 
Hodge conjecture concern Hodge classes of a very special type, which satisfy extra 
arithmetic conditions (being absolute Hodge, see Definition 5) satisfied by algebraic 
classes but not known to be satisfied by all Hodge classes (see Conjecture 7). An 
example of such conditional statement underlining the importance of the Lefschetz 
standard conjecture is Theorem 7 concerning the variational form of the Hodge 
conjecture which asks whether, starting from a variety X with a Hodge class a 
which is algebraic, and deforming X in a family (X;),eg in such a way that the class 
a remains Hodge along the deformation, the class a, also remains algebraic on X;. 

This paper is organized as follows: in Sect. 2, we will define Hodge structures, 
polarizations on them and Hodge classes. In Sect.3, we will present the Hodge 
conjecture, its generalized version, and the few cases in which it is known. We will 
also discuss the standard conjectures. Finally we will turn in Sect. 4 to variational 
aspects of the Hodge conjecture. Sections 3 and 4 use in an essential way the theory 
of mixed Hodge structures which is summarized in Sect. 3.3. 

This quick presentation of the Hodge conjecture does not contain many exam- 
ples. It is an invitation to read the book [23] where many specific known cases of 
the Hodge conjecture are presented. 


2 Hodge Structures, Hodge Classes 


2.1 Hodge Decomposition 


Let X be a complex manifold. The complex structure on X allows to decompose the 
vector bundle of complex differential 1-forms on X as 


Qxe = 22° @ QP’, (1) 


where oF is the vector bundle of 1-forms which are C-linear for the complex struc- 
ture on Tx, locally generated by dz;, the z;’s being local holomorphic coordinates, 


and a a a is its complex conjugate, locally generated by dz;. From (1), we 
deduce a decomposition of the sheaf of C°° complex differential forms of degree k: 


Dy 6 = Bp qr", (2) 
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where %," is the sheaf of differential forms of type (p, g), which can be written in 
local holomorphic coordinates z; as 


a= > ay ydzz A dzj. (3) 
\=p.|Jl=¢ 


It is clear from (3) that the exterior differential d : Be > a satisfies day a 


ae @ Ap 4+! There is thus no reason that the decomposition (2) induces a 
decomposition on the level of de Rham cohomology, that is on the space 


Ker (d : Ak(X) > A+!(Xx)) 


k = 
Hl (X, C) — Im (d : AK-!(X) — A*(X)) ; 


Here A‘(X) := I'(X, ayo) is the space of C° complex differential k-forms on 
X. However, when X is compact Kahler (and a fortiori projective), the Hodge 
decomposition theorem says the following: 


Theorem 1 (Hodge [17]). If X is a compact Kahler manifold, one has a canonical 
decomposition 

FE(X,C) = Opt ganH?4(X), (4) 
where H?4(X) is the set of de Rham cohomology classes of closed differential forms 
on X which are of type (p, q). 


The simplest consequence of this statement is the following restriction on the 
topology of compact Kahler manifolds: 


Corollary 1. [fk is odd, and X is a compact Kéhler manifold, b,(X) is even. 
Indeed, the definition we gave of H?4(X) clearly shows that the Hodge decomposi- 
tion (4) satisfies the Hodge symmetry property: 

HP-4(X) = H4?(X), (5) 


where complex conjugation acts naturally on H*(X,C) = H*(X,R) @ C. The 
conclusion of Corollary | is not satisfied by the simplest example of non-Kéhler 
compact complex surface, namely the Hopf surface S, which is the quotient of 
C? \ {0} by the action of Z given by multiplication by A # 0, where |A| 4 1. 
Indeed, 7, (T) = Z hence bi (T) = 1. 

Note on the other hand that by the change of coefficients theorem, we have 


H*(X,C) = H*(X,Q) @C. 


This leads us to introduce the basic definition of a Hodge structure of weight k: 
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Definition 1. A rational Hodge structure of weight k is the data of a finite rank 
Q-vector space L, together with a decomposition 


Lo :=L@eC= Bp+qakL", (6) 


where the ZL’? C Lc are complex vector subspaces satisfying the Hodge symmetry 
condition L?4 = L1?, 


The data of the Hodge decomposition (6) is equivalent to that of the Hodge filtration 
(which is a decreasing filtration on Lc) 


F'Lg := Dptgetgel (7) 


since LP? = FPLe N F4L¢. 

Hodge structures coming from geometry are “effective”, meaning that L’4 = 0 
for p < 0 or gq < 0. However it is natural to introduce the dual (L*, (L?:7)*) of a 
Hodge structure (L, L’’) of weight k and to give it weight —k, so the effectivity 
condition should not be part of the definition. 

Morphisms of Hodge structures (L, L?%) of weight k and (L’, L’”“) of weight k’ 
are defined only when k’ = k + 2r, as the set of morphisms ¢ : L > L’ of Q-vector 
spaces satisfying 


oc(L?) Cc petra 
The Tate twist L(r) of a Hodge structure of weight k is the Hodge structure L’ of 
weight k — 2r which has the same underlying vector space L’ = L and Hodge 
decomposition L’?4 = L?+"4*+", If X is a compact Kahler manifold, Poincaré duality 
provides an isomorphism of weight —k Hodge structures 
HE(X,Q)* = H"*(X,Q)(n). 
If X and Y are compact Kahler manifolds and @ : X — Y is a holomorphic map, 
o* : HY(Y,Q > H(X,Q 
is a morphism of Hodge structures since the pull-back by ¢ of a closed form of 
type (p,q) on X is a closed form of type (p, g) on Y; by Poincaré duality, the Gysin 
morphism 


o» | H*(X,Q) > H*t?*"(Y,Q), r= dim Y — dimX 


is also a morphism of Hodge structures. 
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2.2 Hodge Structures and Polarizations 


Given a morphism of Hodge structures ¢ : L —> L’, it is obvious how to define 
a Hodge structure on Ker ¢@ and on Im ¢ since morphisms of Hodge structures are 
those which are bigraded after tensoring by C. Hence rational Hodge structures form 
an abelian category. However this category is not semi-simple. This phenomenon 
already appears for weight 1 Hodge structures. An effective weight 1 Hodge 
structure on L is determined by the choice of vector subspace L''° C Le which has 
to be in direct sum with its complex conjugate. Suppose now that (L, L!°) contains 
a Hodge substructure L’ c L, L’ — Le NL!°. The only condition on the space L!\° 
determining the Hodge structure on L is that its intersection with L; has dimension 
$dim L/ . We claim that for a general pair (L’, L) of Hodge structures as above, 
there is no splitting L = L’ @ L” as Hodge structures. Indeed, there are countably 
many choices of such splitting over Q, and for a given splitting, the condition that 
L" C Lis also a Hodge structure means that L!° M L% has dimension 5dim L" . 
The complex dimension of the algebraic subset of the Grassmannian Grass(k, 2k) 
parameterizing the Hodge structures on L for which L’ C L is a Hodge substructure 
is thus equal to k’ —s (k — k’)k while the algebraic subset of the Grassmannian 
Grass(k, 2k) parameterizing the Hodge structures on L for which L is a direct sum 
L' ® L” of Hodge structures is a countable union of algebraic subsets of dimension 
k? + (k—K)*. As kk? + (k—K)k > Kk? + (k— J’, the claim is proved. The 
phenomenon described above does not appear in algebraic geometry where the 
Hodge structures we get are polarized. 


Definition 2. A polarization on a rational Hodge structure L of weight k is a 
nondegenerate intersection form (, ) on L which is symmetric if k is even, skew- 
symmetric if k is odd and satisfies the Hodge-Riemann bilinear relations: 


(1) (a, B) = 0 fora € LP, B EL?” and (p,q) £ (p’.). 
(2) uk(—1)?(a@,@) > 0 fora € L?4,a £0. 


Admittedly, the sign rules in (2) are complicated, but they are imposed on us by 
geometry. The importance of the notion comes from the following: 


Lemma 1. Let L’ C L be a Hodge substructure of a polarized rational Hodge 
structure. Then there exists a Hodge substructure L" C L such that L is isomorphic 
to L' ® L" as Hodge structure. 


Proof. Indeed, let g be the intersection form giving the polarization on L. It suffices 
to prove that the restricted form gz, is nondegenerate since then the orthogonal 
complement L” := L’ +9 is defined over Q, is a Hodge substructure of L by property 
(1) above and satisfies L’ @ L"” = L. Let h(u,v) = c*g(u,d) be the Hermitian 
bilinear form on Le associated to q. It suffices to show that /y;/, is nondegenerate. 
But the Hodge decomposition of L¢ is orthogonal for h by (1) above and each hyz/7.4 
is nondegenerate by (2) above. Hence h);/,, is nondegenerate. Oo 
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The construction of a polarization on the Hodge structure on H*(X,Q) when X is 
a smooth complex projective manifold goes as follows: Let 1 €¢ H?(X,Q) be the 
Chern class of an ample line bundle on X. Then the hard Lefschetz theorem [31, 
6.2.3] gives an isomorphism of Hodge structures 


I'-* —: HEX, Q) > A?"*(X,Q), n= dimX. 


We can thus assume k < n. We then consider the nondegenerate intersection pairing 


(a, B)) = ‘ia. Yar B,a, B € H*(X,Q). 


It is nondegenerate by the hard Lefschetz theorem, does not satisfy property (2) 
above, but satisfies property (1) above. We finally modify it as follows: the Hodge 
structure H*(X,Q) admits the Lefschetz decomposition as a direct sum of Hodge 
substructures 


H*(X,Q) = @2r<l” ~ H**"(X, Doprim; (8) 


where the primitive cohomology is defined by H!~?"(X, Q) prim = Ker (U'-**2"+1 
H**"(X,Q) > H?"-*+r+2(X_Q)). This decomposition is orthogonal for (, )/. 
The polarization (, ) on H*(X,Q) is the unique intersection pairing for which 
the Lefschetz decomposition is orthogonal, and which is equal to (—1)’(, ); on 
I’ ~ He2r (X, Q)prim. The fact that this polarizes (up to a sign) the Hodge structure 
on H*(X, Q) is exactly the contents of the Hodge-Riemann bilinear relations (see 
[31, 6.3.2]). 


2.3 Hodge Classes and Cycle Classes 
2.3.1 Hodge Classes 


Let H be a Hodge structure of even weight 2k, with Hodge decomposition 
Ac = ®p+q=2%H4. 


Definition 3. The Hodge classes in H are the classes in H (hence rational) which, 
via the inclusion H C He, belong to H**. 


We will denote Hdg”*(H) the space H N H* of Hodge classes. Note that this space 
can be reduced to 0 and will be 0 for a general Hodge structure with given Hodge 
numbers h?? = dimH?4 unless h’4 = 0 for p # q, since the space H** C He 
needs not be defined over Q, but only over R (as implied by the Hodge symmetry 
property, that is condition (5)). If X is a smooth projective variety, we will denote 
Hdg”*(X) the space Hdg”*(H?*(X, Q)). 
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2.3.2 Cycle Classes 


Let X be a smooth complex projective or compact Kahler variety of (complex) 


dimension n, and let Z s X be aclosed analytic subset (which in the projective case 
is the same thing according to Chow as a closed algebraic subset) of codimension k. 
If Z is smooth, then Z is a codimension 2k real submanifold endowed with the 
complex orientation, so it has a fundamental homology class [Z] jung € Hon—2%(Z, Z) 
which gives a homology class 


Jx [Z] fund € Aon—2K (X, Z) = H**(X, Z), 


where the last isomorphism is the Poincaré duality isomorphism. If Z is not smooth, 
then according to Hironaka, one can construct a smooth projective variety Z witha 
morphism Tt : ZZ of degree 1. Letting j =f OL: Z — X, we can define the 
class [Z] of Z by 


[Z] = ju [Zl jena € H?*(X, Z). 
Lemma 2. The class of a closed analytic subset Z in a compact Kahler manifold X 


is a Hodge class. 


Proof. Let n be the dimension of X. Then we have Poincaré duality 
H** (X, C) = Hy, eg 


identifying the space H**(X) with the subspace of H7*(X, C) which is orthogonal to 
Bp+q=2n—2k,(p.q)#(n—kn—ky HH? 4(X). It thus suffices to show that for B ¢ H?-4(X), p+ 
q = 2n— 2k, (p,q) # (n—k,n —k), one has ([Z], B)x = 0. Recall from Sect. 2.1 
that H?-4(X) consists of classes of closed forms of type (p,q). The class 6 is thus 
represented by a closed form B which is closed of type (p,q) and introducing a 
desingularization j : Z — X of Z, we have, by definition of the Gysin morphism, 


((Z1, B)x = Ge(Zlpintr Bx = (Zuma F* By = [F A. 


The last expression vanishes since the form j* B vanishes on Z for type reasons. 
Important examples of Hodge classes are provided by the following Lemma 3. 


Lemma 3. Let H, H' be two Hodge classes of weights k,k’ = k + 2r. Then 
the Hodge classes of the weight 2r Hodge structure Hom (H, H’) are exactly the 
morphisms of Hodge structures H — H. 


Here the Hodge structure on H* has been introduced previously, and the Hodge 
structure on the tensor product H* @ H’ = Hom (H, H’) is given by 


(H™ ® Hye = Br+=p, s+s/= =,(H* a *®@ (H')™ s (9) 
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Proof. Indeed a morphism ¢ € Hom(H,H’) = H* ® H’ is of type (r,r) for 
the tensor product Hodge structure if and only if it satisfies Pc € ®is)(H*)"* @ 
(H’)"*’—*, As we have (H*)'* = (H~"~*)*, this is equivalent to 


bc € is) (H")* @ (A’yterts = ®i,s) Hom (H"’, yrs), 


that is, to the fact that dc shifts the Hodge decomposition by (r, r). 


3 The Hodge and Generalized Hodge Conjectures 


3.1 The Hodge Conjecture 


Conjecture 1 (Hodge 1951). Let X be a projective complex manifold. Then for any 
k, the space Hdg**(X) is generated over Q by classes [Z] of codimension k closed 
algebraic subsets of X. 


A codimension k cycle on X is a formal combination Z = ; a;Z;, a; € Q. We 
will call cycle classes [Z] := )°,a;[Z;] algebraic classes, and will use the notation 
H*(X, Q)alg for the space of algebraic classes. We have H**(X,Q)aig C Hdg”*(X) 
and the Hodge conjecture states that H**(X,Q)aig = Hdg”*(X). 


3.1.1 Why Is the Conjecture Important? 


There are very few morphisms in algebraic geometry so it is important to consider 
multivalued morphisms which are given by their graphs I” C X x Y. This leads 
to consider the group &”(X x Y) of codimension m cycles in X x Y, or better 
cycles modulo an adequate equivalence relation ~, like rational equivalence, which 
provides Chow groups, or homological equivalence. When X and Y are smooth and 
projective, cycles in X x Y act on many objects, like Chow groups or cohomology. 
Given an adequate equivalence relation ~ on cycles, the action of F ¢ &"(X x Y) 
takes the general form 


P*(a) = pris (I - prga) e BEM 4M (X)/ w, Wa e B*(X)/ ~, 


where prj« is pushforward by the first projection, pr> is pull-back by the second 
projection and “.” is the intersection product. When the equivalence relation is 
homological equivalence, cycles Z mod. ~ are cohomology classes and the push- 
forward map is the Gysin map, the intersection product is the cup-product. 

The importance of the Hodge conjecture in this context is that, combined with 
Lemma 3, it predicts exactly which morphisms Z* : H*(Y,Q) > A*(X,Q) can 
be constructed from cycle classes in X x Y. Namely, one should get exactly the 
morphisms of Hodge structures. The geometric importance of this prediction is 
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obvious: we mentioned in the introduction Torelli type questions, asking whether 
a variety is determined by its Hodge structures. The Hodge conjecture predicts that 
if two smooth projective varieties X, Y have isomorphic Hodge structures, they are 
related by algebraic cycles in X x Y inducing isomorphisms in cohomology. In a 
more motivic direction, the Hodge conjecture can thus pedantically rephrased by 
saying that the category of polarizable Hodge structures contains the category of 
cohomological motives as a full subcategory, so that structure results for the category 
of polarizable Hodge structures (like semisimplicity, see Lemma 1) also should hold 
for the category of cohomological motives. This adequation of Hodge theory and 
algebraic geometry fits also very well with conjectures of Bloch and Beilinson (see 
[6, 19, 33]) predicting that to a large extent, Hodge structures control Chow groups. 
In our mind however, the generalized Hodge conjecture which will be explained in 
Sect. 3.3 is much more important in this context than the Hodge conjecture itself as 
it says much more, qualitatively, on the relationship between Hodge structures and 
algebraic cycles than the Hodge conjecture does. 

A more technical but important justification of the interest of the Hodge 
conjecture concerns the Hodge classes which appear in the standard conjecture. 
Roughly speaking, these Hodge classes are those which can be produced by linear 
algebra starting from classes of algebraic cycles. The classes so obtained, which will 
be described in Sect. 3.2, are still Hodge classes for linear algebra reasons, but it is 
not known if they are algebraic. The importance of these classes also comes from 
the consideration of the theory of motives. 


3.1.2 Positive Evidences 


The only instances of the Hodge conjecture which are known for any smooth 
complex projective n-fold X are first of all the two trivial cases H°(X,Q) = 
Hdg®(X,Q) = Q|X] fund, (where X is assumed to be connected), and H?"(X,Q) = 
Hdg?"(X,Q) = QIpoint], and secondly the Lefschetz theorem on (1, 1)-classes 
(Theorem 2) which concerns divisor (that is degree 2) classes and its corollary which 
concerns curve (that is degree 2n — 2) classes. 


Theorem 2 (Degree 2). Let X be a complex projective manifold and let 
a € Hdg*(X, Z) be an integral Hodge class. Then a is a combination with integral 
coefficients of classes [D] € H?(X,Z) of hypersurfaces D C X. 


Corollary 2 (Degree 2n — 2). Let X be a complex projective n-fold and let 
ae Hdg”"?(X) be a Hodge class. Then a is a combination with rational 
coefficients of classes [C] € H?"~?(X, Z) of curves C C X. 


Remark 1. The first three cases mentioned above (degrees 0, 2 or 2n) are the only 
cases where the Hodge conjecture is true for integral Hodge classes, that is integral 
cohomology classes whose image in rational cohomology is a Hodge class. This 
follows from Atiyah-Hirzebruch and Kollar counterexamples [3, 20] for integral 
Hodge classes. 
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Proof (Proof of Theorem 2). There is a beautiful description in [13] of the original 
Lefschetz proof. It relies on the notion of normal function associated to a Hodge 
class. Given a Hodge class a € Hdg?(X,Z), we choose a pencil of hyperplane 
sections (X;),<cp1 of X and assume that ay, = 0. The Hodge class @ lifts to a 
class @ in the Deligne cohomology group Ae (X, Z(1)) (see [31, 12.3.1]). Then ayy, 
belongs to 


Ker (Hz, (X;, Z(1)) > H?(X,,Z)) = J'(X,) = Pic®(X,). 


Associated to a we thus found a family of divisors t +> dx, € Pic®(X;). A large part 
of this argument works as well for any Hodge class on a smooth projective variety X 
vanishing on the fibers X; of a pencil on X. Indeed, the Deligne cohomology group 
He (X, Z(k)) fits in the exact sequence 


0 > J*(X) > H2(X, Z(k)) > Hdg”*(x, Z) > 0 


and similarly for X,;. We can thus lift a Hodge class on X to a Deligne cohomology 
class and restrict it to the fibers X,. The problem is that the normal function one shall 
get this way will be a holomorphic section of the family of intermediate Jacobians 
J*(X;);ep!, and one does not know for k > 2 what is the image of the Abel-Jacobi 
map 2*(X,)hom > J*(X,). 

The modern proof of Theorem 2 uses the exponential exact sequence and goes 
as follows: 


1) The Picard group of holomorphic line bundles of an analytic space X identifies 
to H lx , OY), where OF is the sheaf of invertible holomorphic functions. The 
exponential exact sequence 


(eS oe oes 
provides the associated cohomology long exact sequence 
.. H(X, Of) 4 H?(X,Z) > H?(X, Oy)... 
defining c,. 
If X is compact Kahler, the kernel of the natural map H?(X,Z) — H?(X, Gx) 


appearing above is exactly the set of integral Hodge classes. This follows from 
the fact that this map identifies using Hodge theory with the composite 


2 


wm 


H?(X,Z) ~5 H?(X,C) > H°2(X) = H°(X, Oy), 


where all maps are natural and the map H?(X,C) > H®°*(X) is the projection 
given by Hodge decomposition. It thus follows that a class w@ € H?(X, Z) which 
maps to 0 in H?(X, Gx) has a°? = 0 in the Hodge decomposition. But then it 
also has w?:° = 0 since it is real, and thus it is of type (1, 1) hence a Hodge class. 
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3) At this point we proved that if X is compact Kahler, the set of Hodge classes 
of degree 2 is equal to the set of classes c;(Z) where L runs through the set 
of holomorphic line bundles on X. Assume now that X is projective. By Serre 
GAGA principle [27], holomorphic line bundles and algebraic line bundles are 
the same objects on X : equivalently, any holomorphic line bundle has a nonzero 
meromorphic section. Choosing a nonzero meromorphic section o of L, we 
introduce its divisor D, which is a codimension | cycle on X and the final step 
is Lelong’s formula [31, Theorem 11.33] which says that the class [D,] is equal 
to c; (L). 


Proof (Proof of Corollary 2). We use for this the Lefschetz isomorphism 
I"? —: H?(X,Q) > H”"?(X,Q) 


given by the choice of a very ample line bundle & on X with first Chern class 1, 
which is obviously an isomorphism of Hodge structures. A Hodge class 6 of degree 
2n — 2 can thus be written as B = I"? ~ a, where a is a Hodge class of degree 2. 
The class a is the class of a divisor D = ar a;D;, where the D,’s are hypersurfaces 
in X, and thus B = )°,a;[C;] where the curve C; is the intersection of D; with 
a surface L} 1... Ly,-2 complete intersection of hypersurfaces Z; in the linear 
system |.Z| (hence of class /) in general position. 


Apart from these four known cases, the best positive evidence in favour of the 
Hodge conjecture is the fact that Hodge classes behave geometrically as if they 
were algebraic as predicted by the Hodge conjecture. The precise statement will be 
explained in Sect. 4.2. 


3.1.3 Negative Evidences 


Many complex geometry results have been proved in the past by analytic methods 
working as well in the compact Kahler setting, for example the Hodge decomposi- 
tion itself, or the study of positivity of divisors by curvature and currents methods 
[12], or the proof of the existence of Hermite-Einstein metrics on stable vector 
bundles [29]. In the case of the Hodge conjecture, it has been known for a long time 
(see [36]) that in the compact Kéhler setting, there are not enough closed analytic 
cycles to generate the Hodge classes: the example, due to Mumford, is a very general 
complex torus of dimension at least 2 admitting a holomorphic line bundle & with 
nontrivial Chern class which is neither positive not negative: such a torus does not 
contain any hypersurface, while c;() is a nontrivial Hodge class. However, in this 
example, one can argue that the problem is a lack of effectivity (or positivity), and 
that we still have a complex geometric object which is a good substitute for the 
hypersurfaces, namely the line bundle itself (in the projective case, by the existence 
of rational sections of line bundles, Chern classes of line bundles are combinations 
of classes of hypersurfaces). 
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In the paper [30], I constructed examples of Hodge classes on complex tori T, 
which do not belong to the Q-vector space generated by Chern classes of coherent 
sheaves on T. It seems that in these cases, there is no way of extending the Hodge 
conjecture: there is no holomorphic object on T explaining the presence of a Hodge 
class on T. 

The second point which makes not very plausible a solution of the Hodge conjec- 
ture by analytic methods is the lack of uniform solutions to the Hodge conjecture, 
assuming they exist, that is the lack of bound on the cycles (supposed minimal 
in some way) representing a given Hodge class. This follows from the analysis 
of some of the known counterexamples to the integral Hodge conjecture. In the 
case of Kollar counterexamples [20], which are just hypersurfaces X of degree d 
in projective space P”*! with the generator a of H?"~*(X, Z) not being algebraic 
while da is algebraic, it was observed in [28] that the following phenomenon holds: 
Let U be the Zariski open set in the space of homogeneous polynomials of degree 
d such that the corresponding hypersurface is smooth. Then the (locally constant) 
class a, € H?"~?(X;,Z) is Hodge on X; for any t € U, the set of points tr € U 
such that the class a; is algebraic on X; is dense in U for the usual topology, while 
Kollar proves that this set is not the whole of U. This means that for a very general 
point 0 € U, there is a sequence of points ¢, € U converging to 0 and for which 
the class a@;, is the class of an algebraic cycle Z, on X;,,. Thus the cycle Z, is of the 
form Z;* —Z;, but the degrees of the positive part Z* and the negative part Z, of Z, 
cannot be bounded, although the difference Z, has class a@,, which is locally constant 
hence bounded. Indeed, if these degrees were bounded, we could use compactness 
results to make the cycles Z* and Z, converge respectively to cycles Z* and Z~ on 
Xo with [Z*] — [Z~] = a, which is not true. 


3.2 The Standard Conjectures 


The main source of construction of Hodge classes is Lemma 3. Let X be a complex 
projective n-fold, and consider X x X. For any integer k, we have 


End H*(X,Q) = H?"-*(X, Q) @ H*(X, Q) C H7"(X x X, Q) 


and Lemma 3 tells us that a morphism ¢ € End H*(X,Q) provides a Hodge class 
on X x X by the composite map above if and only if ¢ is a morphism of Hodge 
structures. In particular, the identity of H‘(X, Q) is a morphism of Hodge structures, 
hence provides a Hodge class 6, € Hdg*”(XxX, Q). The sum >, 5; is the identity of 
H* (X, Q), hence is the class of the diagonal Ay C X x X. Hence }°, 5; is algebraic 
but it is not known if individually each class 6, is algebraic, that is, satisfies the 
Hodge conjecture. The classes 5; are called the Kiinneth components of the diagonal 
of X. The varieties for which it is known that the Kiinneth components of the 
diagonal are algebraic include the abelian varieties (that is, projective complex tori) 
and smooth complete intersections in projective space, for which the non-algebraic 
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cohomology is concentrated in degree n. If A is an abelian variety (or complex 
torus), A is an abelian group, hence we have for each / the multiplication map 


b:A—> A, ab la. 


We have uj = [Id on H*(A, Q) and it easily follows that we can write the Kiinneth 
components of A as linear combinations of the classes of the graph [7 of jy; for 
various / (note that y* = [I7]* : H*(A, Q) > H*(A, Q)). 

A more subtle construction involves the properties of the Lefschetz operator. 
Recall from Sect. 2.2 that if / is the first Chern class of an ample line bundle # on 
X, the cup-product map 


pk _: HEX, Q) > H”"*(X,Q), n = dimX (10) 


is an isomorphism for any k. It is clear that /’”~* ~ acting on H*(X, Q) is the action 
of the following cycle on X x X: let L},...,L,—-,% be general hypersurfaces in the 
linear system |_| (we may assume very ample), and let Z = L)N...ALy—~. Then 
[Z] = I’-* by Lelong’s theorem, and [i,.Z] ¢ H*”"~7*(X x X,Q) acts on H*(X, Q) 
by I’-* —, where i,4Z is the cycle Z supported on the diagonal Ay = X C Xx X. 
Next we can consider the inverse A,_, : H?”-*(X, Q) — H*(X, Q) of the Lefschetz 
isomorphism (10). This is a morphism of Hodge structures, hence this provides a 
Hodge class on X x X. 


Conjecture 2 (Lefschetz Standard Conjecture). There exists a codimension k 
cycle Z on X x X such that |Z]* : H?"-*(X, Q) > H*(X, Q) is equals to An—k. 


Again the answer is positive in the case of an abelian variety A, and this is due to 
the existence of an interesting line bundle Y on A x A, defined as 4* & where 
ju: AXA — Ais the sum map. The line bundle # is called the Poincaré divisor 
and its class p := c\(#) € Hdg?(A x A) and its powers p* € Hdg?*(A x A) are 
algebraic classes on A x A which allow to solve the Lefschetz conjecture in this case 
(see [24]). 

The Lefschetz standard conjecture is very important in the theory of motives (see 
[1]), because of the semisimplicity Lemma |. This lemma uses the polarization to 
construct, given a polarized Hodge structure L and a Hodge substructure L’ Cc L, a 
decomposition 


L=L@L". (11) 


The construction of these polarizations when L = H*(X,Q) for some smooth 
projective variety X is quite involved, as it uses the Lefschetz decomposition in 
order to modify the natural pairing into one which satisfies the polarization axioms. 
If now L = A*(X,Q) and L’ C L is defined as the image of a morphism [Z]* for 
some algebraic cycle Z on X x X, the Lefschetz standard conjecture is exactly what 
would be needed in order to construct the orthogonal complement L” via the action 
of an algebraic cycle on X x X. 
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The most concrete consequence of the Lefschetz standard conjecture is the 
following (cf. [21]): 


Lemma 4. Let X be a smooth complex projective variety of dimension n. Assume 
the Lefschetz standard conjecture holds for X and some ample class | € Hdg?(X) in 
all even degrees 2k. Then for any k, the intersection pairing between H**(X, Q)ale 
and H?"~**(X, Q)aig is nondegenerate. 


Proof. Indeed, if the Lefschetz conjecture holds for X in any even degree, then the 
Lefschetz isomorphism (10) induces an isomorphism /"~** ~: H*(X, Q)aig = 
H-**(X, Q)aig for all k < n/2, because the inverse A,-,; preserves algebraic 
classes. It follows that the space H**(X, Q)aig is stable under the Lefschetz decom- 
position (8). It suffices to prove that for k < n/2 the pairing (, ); on H7*(X, Q) 
defined by (a, 8); = (I"-** ~ a,B)x, is nondegenerate on H*(X,Q)aig C 
H**(X, Q). By the Hodge-Riemann bilinear relations, the Lefschetz decomposition 
is orthogonal for this pairing and on each piece 1” ~ H?*?"(X,R) prim, the pairing 
(, ); restricted to the subspace H*!~"*~"(X) pp prim C H7*?"(X, Q)prim of real classes 
of Hodge type (k — r,k — r) is definite of a sign which depends only on k — r. 
As I’ ~ He-""(X, Q)aig,prim 18 contained in HEX )p prims it follows that 
the pairing (, ); restricted to /” ~ HE" (Xx, Q)aig,prim Temains definite, and in 
particular nondegenerate. Hence (, ); is nondegenerate on Hk (X, Q)aig which is 
the orthogoanl direct sum of the spaces /”’ ~ H*"(X, Q)ale,prim: 


Let us give two corollaries: 


Corollary 3. (i) Let j : Y — X be a morphism, where X, Y are smooth complex 
projective varieties. Assume X and Y satisfy the Lefschetz standard conjecture. 
Then if Z is an algebraic cycle on Y whose class [Z] € H?*(Y,Q) is equal to 
iB for some class B € H7*(X,Q), there exists a codimension k cycle Z' on X 
such that 


F(Z] = [Z] in H*(Y, Q. (12) 


(ii) If Z is an algebraic cycle on X such that the class [Z| € H?*(X,Q) is equal 
to jxB for some class B € H**-?"(Y,Q), r = dimX — dim Y, there exists a 
codimension k — r cycle Z' on Y such that j{Z'| = [Z] in H**(X, Q). 


Proof. (i) The class B gives by the Poincaré pairing on X a linear form on 
HX, Q)aig, 2 = dim X, which by Lemma 4 applied to X is of the form 
({Z’], }x for some codimension k cycle Z’ on X. We now prove that the class 
[Z’] satisfies (12). By Lemma 4, it suffices to show that for any cycle W on Y, 


(*[Z’], [W])y = (IZ]. [W])y. (13) 


The left hand side is equal to ([Z’], jx[W])x where j is the inclusion morphism 
of Y in X, and by definition of [Z’], this is equal to (6,j.[W])x. Finally, by 
definition of the Gysin morphism j., we have (8, j.[W])x = (*B,[W])y = 
([Z]. [W])y. 
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(ii) is proved exactly in the same way. 


The following corollary appears in [33] where it is proved that the conclusion (for 
all X and Y) is essentially equivalent to the Lefschetz conjecture: 


Corollary 4 (See [33]). Assume the Lefschetz conjecture. Let X be a smooth 
projective variety and let Y C X be a closed algebraic subset. Let Z be a 
codimension k cycle on X whose cohomology class [Z| vanishes in H?*(X \ Y,Q). 
Then there exists an algebraic cycle Z’ supported on Y such that [Z| = [Z’] in 
H*(X, Q). 


Proof. Our assumption is that there is a homology class B € Hon—2%(Y,Q) such 
that the image of jxB € H2n—2%(X,Q) = H7*(X, Q) is equal to [Z]. We now apply 
Lemma 6, which says that if j :¥ > Xisa desingularization of Y, there exists a 
class f’ € H?*-2"(Y, Q) such that j.’ = [Z], where r = dimX — dim Y. We then 
conclude with Corollary 3, (11). 


3.3 Mixed Hodge Structures and the Generalized 
Hodge Conjecture 


In [8], Deligne discovered a very important generalization of Hodge structures, 
namely mixed Hodge structures, see [25]. The definition is as follows: 


Definition 4. A mixed Hodge structure is the data of a finite dimensional Q-vector 
space L equipped with an increasing exhaustive filtration W (the weight filtration), 
together with a decreasing exhaustive filtration F on Lc with the property that the 
induced filtration on Gri,, defined by F?Gri, = F? 0 WjLc/F? ON W;+1Lc, comes 
from a Hodge structure [see (7)] of weight i on Greg. 


Morphisms of mixed Hodge structures are morphisms of Q-vector spaces preserving 
both filtrations. The following result is crucial for geometric and topological 
applications of this notion. 


Lemma 5 (Deligne [8]).. Morphisms of mixed Hodge structures are strict for both 
filtrations. 


Denoting by ¢ : L — M sucha morphism, this means that 
(Im gc) N F?Mc = $c(F’Lc), Um) 1 WiMc = $(WiL). 


We will call the pure Hodge substructure of a mixed Hodge structure the smallest 
nonzero piece W;L C Land the pure quotient the quotient L/ WL where i is maximal 
such that W;L 4 L. they both carry a Hodge structure. 

Deligne proves the following result: 
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Theorem 3. For any quasiprojective variety X, its homology groups and cohomol- 
ogy groups carry mixed Hodge structures, which are functorial under pull-back on 
cohomology and functorial under pushforward on homology. 

If X is smooth, the pure Hodge substructure on H*(X,Q) has weight k (so all 
weights are > k) and is equal to Im(H*(X,Q) — H*(X,Q)) for any smooth 
projective compactification X of X. 

If X is projective, the pure quotient Hodge structure of H*(X,Q) has weight k (so 
all weights are < k) and is equal to Im (H*(X,Q) > HE (xX, Q)) for any smooth 
projective desingularization x of X. The dual statement is that the pure Hodge 
substructure of Hy(X,Q) is the image Im (Hy (X, Q) — A(X, Q)) for any smooth 
projective desingularization xX of X. 


Let now X be a smooth projective variety, and Y C X be a closed algebraic 
subset of X. Assume for simplicity that all the irreducible components of Y are 
of codimension r. 


Theorem 4. Let U := X \ Y. Then the kernel 
Ker (H*(X, Q) > H*(U, Q)) 


is a Hodge substructure Ly of H*(X,Q) which is of Hodge coniveau > r, meaning 
that Li; = 0 for p < rorg <r. 


Proof. We will use the following consequence of Theorem 3 and Lemma 5 which 
is of independent interest: 


Lemma 6. In the situation of Theorem 4, the kernel Ker (H*(X,Q) > H*(U,Q)) 
is equal to the image of the composite map 


Je: Hon-e(¥.Q) > Hon-1(X,Q) = H*(X, Q), (14) 


where j :¥>Xisa desingularization of Y. 


Proof. This kernel is the image of the composite map 


PD 
Hon—1(Y,Q) > Hon-4(X,Q) = H*(X, Q). 


This map is a morphism of mixed Hodge structures, the right hand side being a 
pure Hodge structure of weight k. Comparing weights and applying Lemma 5 and 
Theorem 3, the image of this map is the same as the image of the pure Hodge 
substructure of Ho,-,(Y,Q), that is Im (Hon—x(Y, Q) > Aon-x(Y,Q)), which 
concludes the proof. 


Of course, as Y is smooth and projective, the composite in (14) is the same as 
the Gysin morphism j, : H*"(Y, Q) > H*(X,Q). As jx is a morphism of Hodge 
structures of bidegree (r,r), its image is a substructure of H*(X,Q) which is of 
Hodge coniveau > r. 
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The generalized Hodge conjecture due to Grothendieck [15] states the following: 


Conjecture 3. Let X be a smooth complex projective variety and let L C H*(X, Q) 
be a Hodge substructure of Hodge coniveau > r. Then there exists a closed 
algebraic subset Y C X of codimension > r such that L C Ker (H*(X,Q) > 
H*(U,Q)), U:=X\ Y. 


The Hodge Conjecture | is the particular case of Conjecture 3 where k = 2r. Indeed, 
a Hodge substructure of H?"(X,Q) which is of Hodge coniveau > r is made of 
Hodge classes. Conjecture 3 predicts in this case that L vanishes away from a closed 
algebraic subset Y C X of codimension r, which is the same as saying that L is 
generated by classes of irreducible components of Y (see [31, 11.1.2]). Conjecture 3 
corrects an overoptimistic formulation of the Hodge conjecture (see [18]), where 
any rational cohomology class a of degree k with Hodge decomposition 


_ k-rr 


ac=a +...$ 07%, 


that is, satisfying a’? = 0 for p < rorgq < r, is conjectured to be supported on 
a codimension r closed algebraic subset. This is wrong by Theorem 4 which says 
that if @ is supported on a codimension r closed algebraic subset, then the minimal 
Hodge substructure L C H*(X, Q) containing @ also satisfies L?? = 0 for p < ror 
q <r (see [15], [32, Exercise 1 p 184]). 

The generalized Hodge Conjecture 3 cannot be deduced from the Hodge 
conjecture, unless the following conjecture is answered affirmatively: 


Conjecture 4. Let X be a smooth projective complex variety and let L C H*(X,Q) 
be a Hodge substructure of Hodge coniveau = r (thus L(r) is effective of weight 
k—2r). Then there exists a smooth projective variety Y, such that L(r) is isomorphic 
to a Hodge substructure of H'~*"(Y, Q). 


We now have: 


Proposition 1. Conjecture 4 combined with the Hodge conjecture implies Conjec- 
ture 3. 


Proof. Note that by the hard Lefschetz theorem, it suffices to prove Conjecture 3 
for L C H*(X,Q) with k < n. Next assume Conjecture 4. Then since k < n we 
can assume by the Lefschetz theorem on hyperplane section that dim Y = n — r. 
Now L(r) is a direct summand of H*~?"(Y, Q) and the Hodge structure isomorphism 
L(r) = L Cc H*(X,Q) provides by Lemma 3 a Hodge class a of degree 2n on 
Y x X. Assuming the Hodge conjecture, a is algebraic, which provides a cycle Z = 
>, aZ;, Z; C Y x X, dimZ; = n—r, such that L = Im({Z]x : Hk ?"(Y,Q) > 
H*(X,Q)). But then L vanishes away from the codimension > r closed algebraic 
subset Y’ := Ujpr2(Z;) of X. 


538 C. Voisin 


4 Variational Hodge Conjecture 


4.1 The Global Invariant Cycles Theorem 


The following result is due to Deligne [8]. Let @ : X — B be a holomorphic 
map from a smooth projective variety X to a connected complex manifold, and 
let f° : X° — B® be the restriction of @ over the open subset B° of B of regular 
values of @. By definition, 6° : X° — B® is proper with smooth fibers, hence is a 
topological fibration. There is thus a monodromy representation p : 7(B°,b) > 
Aut H*(X;, Q), where b € B® is a regular value. 


Theorem 5. The image of the restriction map H'(X,Q) > H*(Xp, Q) is equal to 
the subspace H*(X;,, Q)? of monodromy invariant cohomology classes. 


Proof (Sketch of Proof). The proof of this theorem splits into two parts. First of 
all, Deligne proves in [11] that the Leray spectral sequence for ¢° degenerates at 
E, a result which was also known to Blanchard [4]. This implies that the space 
H*(X;, Q), which is also the image of H®(B°, R'¢2Q) in H*(X;, Q), is equal to the 
image of the restriction map 


H*(X°,Q) > H*(Xp, Q). (15) 


The second step uses the full strength of Theorem 3. The morphism (15) is a 
morphism of mixed Hodge structures, the Hodge structure on the right being pure, 
that is, equal to its minimal Hodge substructure. The mixed Hodge structure on the 
left has for minimal Hodge substructure (or pure part) the image of the restriction 
map H*(X,Q) — H*(X°, Q). Comparing weights, it then follows from Lemma 5 
that the two restriction maps H‘(X°,Q) > H*(X,, Q) and H*(X, Q) > H*(X,, Q) 
have the same image. 


4.2 The Algebraicity Theorem and Application 
to the Variational Hodge Conjecture 


The following theorem proved in [7] is the best known evidence for the Hodge 
conjecture. It says that Hodge classes behave geometrically as if they were algebraic. 
Let 6 : 2 — B be a projective everywhere submersive morphism, with 2, B 
smooth quasi-projective. For any b € B, denote by 2% the fiber 6~!(b). Let a € 
Hdg**(.%,) be a Hodge class. The Hodge locus of a is defined as the set of points 
t € B, such that for some path y : [0,1] > B with y(0) = b, y(1) = +t, the class 
as € H**(.25,.),Q) remains a Hodge class for any s € [0, 1]. Here a, is the class 
a transported to 24s) using the natural isomorphism H**(.27,,Q) = H**( 24s), Q) 
given by topological trivialization of the pulled-back family 2), — [0, 1]. 
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Theorem 6 (Cattani, Deligne, Kaplan 1995). The Hodge locus of a is a countable 
union of closed algebraic subsets of B. 


Note that the local structure of this locus, say in an open ball B’ C B, as a countable 
union of closed analytic subsets of B’ was understood since the developments of the 
theory of variations of Hodge structures due to Griffiths [14]. The difficulty here 
lies in the comparison between the analytic and the algebraic category (the basis B 
is almost never projective in the above theorem). 

That this is indeed the structure predicted by the Hodge conjecture for the Hodge 
locus of a follows from the existence of relative Hilbert schemes (or Chow varieties) 
which are projective over B and parameterize subschemes (or effective cycles) 
Z,; C X, of a given cohomology class. Using these relative Hilbert schemes Mi, 
we can construct a countable union of varieties Mj projective over B, defined by 
Mi = M; Xz M; and parameterizing cycles Z; = va — Z; in the fibers .2;. For any 
point t € B, if the class a, on 2; is algebraic, a, is the class of a cycle ze —Z, 
parameterized by a point in the fiber of at least one of these varieties Mjj. Hence the 
Hodge locus is the union of the images of Mj in B over the pairs (i,j) such that the 
cycles parameterized by Mj; are of class a. 

Let us explain the importance of this theorem in the context of the “variational 
Hodge conjecture”. Here the situation is the following: 2 is acomplex manifold, A 
is a complex ball centered at 0, 2 — A is a proper submersive holomorphic map 
with projective fibers 2;, t € A, anda € H**(.2, Q) is a cohomology class which 
has the property that a, := ag, is a degree 2k Hodge class on 2; for any t € B. 


Conjecture 5 (Variational Hodge Conjecture). Assume that ao satisfies the 
Hodge conjecture, that is, is algebraic on 2o. Does it follow that a, is also 
algebraic? 


Theorem 7. The variational Hodge conjecture is implied by the Lefschetz conjec- 
ture. 


Proof. The family of projective varieties (.2%)yea is the pullback of an algebraic 
family 2°“% — B viaa holomorphic map f : A > B. Our assumption is that f(A) 
is contained in the Hodge locus By of the Hodge class ap on 20. By Theorem 6, this 
Hodge locus is algebraic, and we can thus replace A by an irreducible component Bi, 
of By passing through 0 and containing f(A). We can assume that B’, is smooth by 
desingularization. By definition of By, the class a, deduced by parallel transport 
from the class a is Hodge on all fibers 2; of the family 2;"° — Bi. The 
monodromy has finite orbits on the set of cohomology classes in fibers which are 
Hodge everywhere (see [35, Theorem 4.1]). Replacing B’, by a finite étale cover, we 
can thus assume that the class a@ is monodromy invariant on B’,. Let us introduce 


a smooth projective completion .2;"% of 2;"%. By Theorem 5, there exists a class 


B € H*(.22"8, Q) such that |. = &. We now apply Corollary 3 (i) toX = Rae, 
Y = 4. As the class a9 = jy, is algebraic, there exists assuming the Lefschetz 


standard conjecture a cycle Z on X'S such that [Z]| 2% = @o, hence [Z]| a, = a, 
Vt € AC B), and thus q, is also algebraic. 
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4.3 Algebraic de Rham Cohomology and Absolute Hodge 
Classes 


The following arithmetic counterpart of Theorem 6 is completely open except for 
abelian varieties [9] (see also [26, 34] for some partial results) : 


Conjecture 6. In the situation of Theorem 6, assume the family 2 — B is defined 
over a field K (in fact, we can always assume K to be a number field). Then the 
Hodge locus of a is a countable union of closed algebraic subsets of B which are 
defined over a finite extension of K. 


Using the global invariant cycle theorem, this conjecture would allow to reduce the 
Hodge conjecture to the case of varieties X defined over a number field (see [34]). 
It would be disproved by the existence of a variety X not defined over a number 
field, with a Hodge class a such that the pair (X, a) is rigid (meaning that under a 
nontrivial deformation of X, the class a does not remain Hodge). 

We next introduce the notion of absolute Hodge class. Let X be a smooth 
projective variety defined over C. In the following, we will write X“” for the complex 
manifold associated with X and cohomology on X will be coherent cohomology 
with respect to the Zariski topology on X. We have a chain of isomorphisms whose 
combination gives the Grothendieck comparison isomorphism [16]: 


HK (x, Qc) ~ K(x”, ea) ~ HE (Xx, C). 


The first term is algebraic de Rham cohomology of X over C. The second term is 
holomorphic de Rham cohomology of X“ and the first isomorphism comes from 
Serre’s GAGA theorem [27]. The second isomorphism comes from the fact that 
the holomorphic de Rham complex is a resolution of the constant sheaf C on X“". 
Note that the Grothendieck isomorphism gives an algebraic definition of the Hodge 
filtration, namely, it induces for any p an isomorphism 


HE (x, Qe) ~ FP HE (XC). (16) 


Let now t : C > C be a field automorphism. Clearly t induces an isomorphism 
(which is not C-linear) 


Ts AE (X, Qc) & HE (Xr, QE je), (17) 
where X; is the complex algebraic variety whose equations are obtained by applying 
tT to the coefficients of the defining equations of X. Composing this automorphism 
with the Grothendieck isomorphisms 


Hi (X, Qk jc) = H*(X",C) (18) 


for X and X,, we get an isomorphism H*(X%",C) = H*(X2",C), a@ + a;. This 
isomorphism is compatible with the Hodge filtrations by (16). 
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Definition 5. Let a be a degree 2k Hodge class on X. We say that @ is an 
absolute Hodge class if the class (2iz)*a =: a’ has the property that for any field 
automorphism t of C, a! belongs to (2u7)*H7*(X2", Q). 


Remark 2. The class @/ is then (2i7)* times a Hodge class on X,, as it belongs to 
F*H?*(X2",C) since a belongs to F*H7*(X4", C). 


We now use the existence of an algebraic cycle class Z +> [Zap] with value in 
algebraic de Rham cohomology (see [5] for an explicit construction). It is clear that 
if t is a field automorphism of C, and Z is a codimension k algebraic cycle on X, 


tx(Z]ar = [Zelar in H(X,, 25, jc); 


where Z, is the cycle of X; obtained by applying t to the defining equations of the 
components Z; of Z. Finally we use the comparison formula saying that, via the 
Grothendieck isomorphism (18), [Z]ar = (2usr)*[Z]. We then get: 


Proposition 2. Cycle classes on smooth projective varieties are absolute Hodge. 


Conjecture 6 is a weak form (see [34]) of the following Conjecture 7 (which by 
Proposition 2 is part of the Hodge conjecture). 


Conjecture 7. Hodge classes are absolute Hodge. 


Deligne [9] proves Conjecture 7 for abelian varieties. It follows from the 
compatibility properties of the Kuga-Satake construction [22] (see [10]) that it 
is true as for Hodge classes on (powers of) hyper-Kahler manifolds lying in the 
subalgebra generated by H?. 

In general, one can say from the above discussion that the Hodge conjecture has 
two independent parts, each of which might be true or wrong, namely Conjecture 7 
on the one hand and on the other hand the conjecture that absolute Hodge classes 
are algebraic, which is in the same spirit as the Lefschetz standard Conjecture 2 but 
also concerns more mysterious classes, like Weil classes on abelian varieties with 
complex multiplication. 

Let us conclude with an example of an absolute Hodge class which is not 
known to be “motivated” in the sense of André [2]. André defines the set of 
motivated classes as the smallest set of classes on smooth projective algebraic 
varieties containing algebraic classes, and stable under the operators A, inverse of 
the Lefschetz operators and under any other algebraic correspondence. Motivated 
classes include classes a, € Hdg” (X,), for some Hodge class a on a smooth 
projective variety X — B (where B is connected), such that for some regular value 
0 € B, ay € Hdg”* (Xp) is algebraic. 


Example 1. Let X be smooth complex projective, and let bx, := dim H?*(X, Q). 
Then the space 


box 


rAeiai ea Q) Cc H (Xx, Q) 2% Cc Hex (x? Q) 


is clearly a Hodge substructure which is of rank 1, hence generated by a Hodge 
class on X?*. This class is clearly an absolute Hodge class. Note that one can 
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make the same construction with odd degree cohomology, but in this case the 
existence of a polarization easily implies that the classes one gets are algebraic, 
or at least motivated. For this reason, by specializing to Fermat hypersurfaces, the 
class constructed above is motivated for all smooth hypersurfaces. 
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